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Preface to the 2nd Edition 


Modelling Financial Time Series was first published by John Wiley and 
Sons in 1986. The first edition appeared when relatively few people were 
engaged in research into financial market prices. In the 1980s, it required a 
considerable effort to obtain long time series of prices, while further effort 
was required to write computer programs that could test hypotheses about 
price behaviour. Furthermore, the potential for modelling and forecasting 
volatility was only appreciated by a handful of researchers. All this has 
changed during the last 20 years — now there are a considerable number of 
empirical finance researchers, price data are freely available on the internet, 
financial software is widely available and the implications of stochastic 
volatility for risk managers and traders are generally understood. 

The first edition made a particular contribution to the development of 
financial econometrics, by describing the empirical characteristics of market 
prices and several models that could explain the early empirical evidence. 
Some of the people cited in the first edition have continued to expand our 
understanding of the dynamic properties of asset prices, most notably Robert 
Engle, Clive Granger and George Tauchen. Important new ideas subsequently 
flowed from a talented generation of researchers, many of whom are named 
in the additional references at the end of this preface. | am pleased to 
acknowledge these contributions, of which several have been made by Tim 
Bollerslev and Neil Shephard. 

The remainder of this preface summarises both the innovative material 
included in the first edition and the related key innovations in more recent 
years. A detailed, contemporary presentation of models for financial time 
series can be found in Taylor (2005). 

Chapter 2 documents the statistical features of daily asset returns. The 
three most important features are summarised on page 58: daily returns are 
there described as having firstly low autocorrelations, secondly non-normal 
distributions and thirdly a non-linear, generating process. The third feature 
was deduced by observing that there is "substantially more correlation 
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between absolute or squared returns than there is between the returns 
themselves” (p. 55). This third “stylized fact' seems to have been generally 
unknown when my book was published, in contrast with the first and second 
facts which were well-known from Fama's (1965) paper and earlier studies. 
Some people refer to the third fact as the "Taylor effect' and, additionally, 
observe that absolute returns seem to have higher correlations than squared 
returns (from Tables 2.11 and 2.12). The third fact is important because it 
shows that we need price models which do not assume that returns have 
independent and identical distributions. 

The non-normality of returns, their almost-zero autocorrelations and 
the positive dependence among absolute and squared returns remain the 
three most important 'stylized facts' about daily returns. A few studies have 
discussed the power 4 that maximises the autocorrelations of the time series 
{Ir41}, with r, denoting the return for day t in this preface. The results in Ding, 
Granger and Engle (1993), Granger and Ding (1995) and Taylor (2005) show 
that the maximum autocorrelation most often occurs when 4 is approximately 
one, i.e., for absolute returns. 

Chapter 3 provides reasons for changes in volatility, and it states and 
compares several stochastic volatility models. The prior work of Clark and 
Tauchen & Pitts had already related volatility to the number of intra-day 
price movements (or news events), as shown and discussed on page 71. By 
supposing these measure of market activity are correlated across trading days, 
| obtained explanations for both the stochastic behaviour of volatility and 
the correlation among absolute and squared returns. The simplest interesting 
model for volatility supposes that it follows a two-state Markov chain, as 
considered on page 67, but more volatility states are required to provide a 
realistic description of market prices (Taylor, 1999). 

Two of the volatility models described in Chapter 3 have been used in 
numerous empirical studies. The lognormal, autoregressive model for a latent 
volatility variable was first published in Taylor (1982) and it is included in 
Section 3.5 and introduced on page 73. It is often called the SV (stochastic 
volatility) model. In modern notation, it defines the return r, as its expectation 
4 plus the product of the volatility o; and an i.i.d. standard normal variable u, 


r= и + OU; 
with 
log(a) = a+ @(log(o,+) – Q) + 7. 
Because this model has two random components (u, and 7,) per unit time, it 


is impossible to exactly observe the realisations of the volatility process and 
relatively difficult to estimate the parameters of the АК(1) process for log(o;). 
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These technical problems have been overcome in papers such as Jacquier, 
Polson and Rossi (1994), Shephard (1996), Shephard and Pitt (1997) and 
Sandmann and Koopman (1998). 

Several extensions of the above SV model have been investigated. 
These include replacing the normal assumption for the variables u, by a 
Student-t distribution (e.g., Harvey, Ruiz and Shephard, 1994), introducing 
some dependence between u, and either 7, or 7,4 (e.g, Yu, 2005), and 
replacing the AR(1) specification for the logarithm of volatility by a long- 
memory process (e.g., Breidt, Crato and de Lima, 1998). Surveys of the SV 
and related models are provided by Shephard (1996) and Ghysels, Harvey 
and Renault (1996). The recent book by Shephard (2005) is a collection 
of important SV papers, which also contains an expert review of SV literature. 
Shephard's book provides access to the remarkable working paper 
of Rosenberg (1972), which anticipates both SV and ARCH models; 
unfortunately this paper was unknown to almost all scholars (including 
myself) until recently. 

The second volatility model described in Chapter 3 that has been used in 
numerous empirical studies is called an ARMACH model, on page 79 and 
defined by equation (3.6.7). It is now known as the GARCH(1,1) model, 
which is the name introduced in Bollerslev (1986). At about the same time, 
we independently defined and investigated this model, which is a natural 
development from the ARCH(p) model defined in Engle's path-breaking 1982 
paper. 

Again using modern notation, the original version of GARCH(1,1) 
defines the return r, as its expectation plus the product of the conditional 
standard deviation J/h, and an i.i.d. standard normal variable z, 


n-ut Ма 
with 
h,= @+ ar, - uy. + pha. 


Тһе great advantage of this model, compared with SV models, is that there 
is only one random component (z,) per unit time and hence, the likelihood 
function is easy to calculate as the product of conditional densities (p. 90). 
The availability of maximum likelihood estimates explains why ARCH 
models are more popular than SV models in the research literature. 

When | wrote the first edition, | preferred SV models to my ARMACH 
models because | did not like the assumption that returns up to time t— 1 
then contain all the information needed to correctly identify a conditional 
normal distribution for the return during period t. This assumption does not 
permit unexpected news on day t to partially determine the variance of the 
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conditional normal distribution, and it is also contradicted by the empirical 
forecasting evidence presented in Chapter 4 (see p. 107). | soon appreciated, 
however, that my arguments against ARCH do not apply to the more 
general specifications of Engle and Bollerslev (1986) and Bollerslev (1987) 
that permitted the z, to have a fat-tailed distribution; their Student-t choice is 
a mixture of normal distributions that can be motivated by uncertainty at time 
t — 1 about the amount of relevant news on day t. There are many similarities 
between SV models and general ARCH models, which are surveyed in Taylor 
(19942). It is very difficult, and possibly of minimal value, to say which of 
these two types of models provides the best description of market returns. 

A general ARCH model has a general specification for the conditional 
variance h, and a very considerable number of specifications have been 
suggested. In particular, the general GARCH(p,q) model of Bollerslev (1986) 
makes h, a linear function of the p previous squared excess returns and the 
qterms h;u, 1 < i < q. 

We should note that the signs of the previous residuals, r;— 4, аге 
irrelevant in the GARCH model. Nelson (1991) made a very important 
contribution to volatility modelling by demonstrating that the symmetric 
treatment of positive and negative residuals is not appropriate for U.S. index 
returns. The estimates for his exponential GARCH model show that h, 
is higher when the most recent residual is negative than when it has the 
same magnitude and is positive. Many people have subsequently evaluated 
asymmetric ARCH models, with one of the most popular being the following 
simple extension of GARCH(1,1) defined by Glosten, Jagannathan and 
Runkle (1993): 


h= @+ (a + S aora Д? + Bh. 


with S1 = 1 if r < “and otherwise S. = 

Other notable ARCH variations include working with linear functions 
of absolute values rather than squares (Chapter 3 and Schwert, 1989), 
making the conditional mean a function of h, (Engle, Lilien and Robins, 
1987), and specifying a long-memory process for h, (Baillie, Bollerslev 
and Mikkelsen, 1996). The survey by Bollerslev, Chou and Kroner (1992) 
covers a considerable number of applications, while the subsequent survey 
by Bollerslev, Engle and Nelson (1994) is more theoretical but also contains 
detailed examples of the specification of conditional densities for daily 
returns from U.S. indices, going back as far as 1885. 

Chapter 4 compares some methods for estimating and forecasting volatility. 
Inferences about volatility forecasting methods are obtained indirectly by 
comparing methods for forecasting absolute returns. It is recommended that 
exponentially weighted moving averages of absolute returns are used to 
measure volatility, as they are robust against changes in the long-run level 
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of volatility and are also empirically as accurate as alternative methods. 
Volatility forecasts are of particular interest to risk managers and option 
traders, and a vast literature now compares forecasting methods; for a recent 
survey see Poon and Granger (2003). The best forecasting methods rely upon 
volatility levels implied by option prices and upon high-frequency returns, 
typically by using measures such as the daily sum of squared five-minute 
returns (e.g., Blair, Poon and Taylor, 2001). 

Chapter 5 evaluates the accuracy of autocorrelations estimated from 
returns. The classical standard error formula is Vn for a sample of n 
observations from independent and identical distributions. This formula fails 
for the uncorrelated processes introduced in Chapter 3; a suitable formula is 
b] Jo , with b » 1 estimated from the observed data. The formula evaluated 
comes from my 1984 paper and it is also derived (using different 
assumptions) in Lo and МасКіпіау (1988). Rescaled returns, defined as 
returns minus their average divided by an estimated conditional standard 
deviation, are shown to have autocorrelations whose sample variances are 
much closer to the classical result. 

Chapter 6 presents methods and results for tests of the random walk 
hypothesis. One of the test statistics is derived in one of my 1982 papers and 
it is powerful when the alternative to randomness is trends in the price 
process. It is argued that any trends will create small positive autocorrelation 
among returns, that declines as the lag increases. An ARMA(1,1) model is 
the simplest that has these autocorrelation properties. The appropriate test 
statistic is then proportional to 2261 >, , with фа suitable AR parameter value, 
P, the sample autocorrelation for lag тапа the sum taken over the first k lags. 
The trend test is shown to reject the random walk hypothesis for commodity 
and currency returns, for choices of ø and k that have high test power 
for plausible trend parameters. This conclusion does not hold, however, for 
the tested equity series, that appear instead to be best described by МАП) 
processes. 

Results from the trend test have only been reported in a few papers, such 
as Kariya et al. (1995). Many papers have instead employed the variance- 
ratio test statistic of Lo and MacKinlay (1988). This test compares the variance 
of k-day returns with k multiplied by the variance of one-day returns. It is 
equivalent to basing the test upon the linear combination 2k – z), , with 
the sum over the first k— 1 lags. As both my trend test and the variance-ratio 
test are based upon sums Zw, д,, with и > w, > мз > + > 0, they are both 
expected to be powerful for the same alternatives to the random walk 
hypothesis. These alternatives include the ARMA(1,1) specifications provided 
by my price-trend models, and then all the dependence is positive. They also 
include the models of Fama and French (1988) and Poterba and Summers 
(1988) that assert market prices revert towards rational levels that follow a 
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random walk, and then all the dependence is negative. Several further test 
statistics invented after 1986 are reviewed in Taylor (2005). 

Chapter 7 provides explicit statistical models for trends in prices which 
explain a small proportion of the total variation in daily returns. The trend 
parameters are estimated and then used to find optimal forecasts of future 
returns, which are based upon theoretical results for ARMA(1,1) models. 
While these forecasts are only marginally more accurate than random walk 
forecasts, they are of interest because they can be employed in trading rules 
that are potentially profitable as is shown in the next chapter. 

Chapter 8 presents some evidence that trading rules could have earned 
excess returns at futures markets, by seeking to identify and follow trends. 
The theoretical analysis commencing on page 204 shows that small amounts 
of positive dependence among returns can be exploited profitably, providing 
the total dependence across all lags is sufficiently high and transaction 
costs are sufficiently low. Subsequent Monte Carlo analysis of hypothetical 
currency futures prices in Taylor (1994b), traded using the channel trading 
rule endorsed by Lukac, Brorsen and Irwin (1988), shows that annual 
excess returns of 496—596 can be earned when the autocorrelations are 
Pr  0.02(0.95*) and the total cost of opening and later closing an unlevered 
futures position equals 0.2% of a trader's capital. 

Chapter 8 also shows that statistical trend-following trading rules were 
profitable in the late 1970s and early 1980s. Several other researchers 
have shown that simple trend-following rules applied to exchange rates 
would have provided trading profits throughout the 1970s and 19805 (e.g., 
Sweeney, 1986; Kho, 1996; LeBaron, 1999), but these opportunities seem to 
have disappeared in the 1990s (Olson, 2004). The most popular explanations 
for the profitable results refer to either a time-varying risk premium or to 
central bank intervention. 

Chapter 9 suggests methods for valuing options when the volatility is 
stochastic and/or there are trends in prices. These methods only provide 
valuations that are approximately correct. The general theory for pricing 
options in a stochastic volatility economy is presented in Hull and White 
(1987). Some special cases have closed-form valuation equations. In 
particular, the remarkable paper by Heston (1993) shows how to value 
options when the continuous-time process for volatility is a square-root 
process; his results are generalised by Duffie, Pan and Singleton (2000) to 
also allow for jumps in the price process. 

The first edition only covers time series of prices recorded once a day 
or less often. The most important development since then is the empirical 
analysis of high-frequency prices, typically recorded every five minutes. 
High-frequency returns display the same stylized facts as daily returns, but 
additionally they also have substantial variations in the average level of 
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volatility during the day. Some of this variation can be explained by 
macroeconomic news announcements. 

volatility can be estimated, modelled and predicted more accurately by 
using high-frequency returns. Much recent research has focused on the 
information conveyed by realised volatility, which is defined as the square 
root of the sum of the squares of intraday returns. Realised volatility has been 
found to have an approximately lognormal distribution (which matches the 
distributional proposal made in Chapter 3) and a long-memory property 
(Andersen et al, 2001a, 2001b). Furthermore, the distribution of daily 
returns conditional upon realised volatility is approximately normal so the 
unconditional distribution is approximately the lognormal-normal distribution 
of Clark mentioned on page 48. Finally, one of the most interesting subjects 
of current research is the detection of jumps in the price process, using 
methods pioneered by Barndorff-Nielsen and Shephard (2006). 
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Preface to the 1st Edition 


The prices of stocks, commodities and currencies are for ever changing. 
Anyone interested in financial prices soon discovers that changes in prices 
are frequently substantial and are always difficult to forecast. This book 
describes the behaviour of prices from a statistical perspective. Prices 
recorded at regular intervals of time define a time series from which much 
can be learnt. In particular, past prices can be used to give insights into 
future price behaviour. Forecasts of future prices and of the volatility of 
future prices are especially interesting. 

Forty time series are investigated in this text. They include prices for 
stocks in New York and London, agricultural futures in Chicago, London, 
and Sydney, spot bullion and metal contracts in London, and currency 
futures in Chicago. These prices are used to construct statistical models 
and to explore the benefits from relevant forecasts. 

Although several books cover time series in general, following the 
methods established by Box and Jenkins, and a few books focus on 
economic time series, particularly the text by Granger and Newbold, this is 
the first book about financial time series since the pioneering works by 
Granger, Morgenstern, and Labys fifteen years ago. 

It will be shown that new models are needed because standard models 
incorporate assumptions which are not fulfilled by financial time series. 
Unfortunately researchers often assume or investigate unsatisfactory 
models. | hope this book will help researchers to use and develop better 
models. 

Readers will need to be familiar with statistical concepts and some prior 
study of time series methods is desirable, although not essential. Invest- 
ment analysts should find this book instructive even if they find the 
statistical material demanding. 

Chapters 1 and2 provide an introduction to financial markets, properties 
of day-to-day returns on investments in financial assets, previous research, 
and also time series definitions and models. Chapters 3 to 7 cover the 
construction of accurate models for daily returns. These models include 
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changes in variance corresponding to changes in price volatility. Certain 
models also include weak dependence between price changes as the 
familiar random walk hypothesis is shown to be false. Forecasts are 
discussed in some detail. Chapters 8 and 9 explore interesting implications 
of the models for traders of futures and options. An appendix gives a 
computer program for readers wanting to model their own series. 

Г am very grateful to the many people who have made this research 
possible. Special thanks are due to those who generously gave me price 
data or collected it for me, especially Lakshman Balasuriya, Carmelo 
Giaccotto, Rui Guimaraes, Brian Kingsman, Lester Madden, Ash Patel, 
Peter Praetz, and Harry Shaw. Monash University, Australia, provided 
excellent resources for me to write this book whilst on sabbatical leave 
from the University of Lancaster. Barry Goss and Peter Praetz kindly 
arranged my visit down-under. The manuscript was typed with remarkable 
fortitude by Anne Welsby at Lancaster and by Kathy Fullard and Julie 
Harder at Monash. Various tables and figures are reproduced here by kind 
permission of Basil Blackwell Ltd, Chapman and Hall Ltd, the Operational 
Research Society, the Royal Statistical Society, and the Western Finance 
Association. 


University of Lancaster Stephen Taylor 
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England 
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Introduction 


1.1 FINANCIAL TIME SERIES 


Financial prices are continually brought to our attention. Daily news 
reports on television and radio inform us, for example, of the latest stock 
market index values, currency exchange rates, and gold prices. The reports 
often highlight substantial changes in prices. Over a period of a few 
months prices can move up or down by several per cent from their original 
levels. The Dow Jones index was 777 during August 1982 and rose by over 
50 per cent to 1174 exactly one year later. The number of US dollars sold for 
one poundsterling increased from 1.11 in January 1985 to 1.44 in September 
1985 whilst the price of certain coffee futures fell from £2410 to £1674 during 
the same period. Many more examples of large price movements could be 
given. 

It is often desirable to monitor price behaviour frequently and to try to 
understand the probable development of prices in the future. Suppose 
you have planned a holiday abroad and will need to buy foreign currency. 
This purchase could be made months in advance or left until you arrive at 
your destination. You may well look up the latest exchange rates from time 
to time and try to forecast them. We call the series of prices thus obtained 
a financial time series. This title will be used for any series of numbers 
based upon financial prices and we will especially consider stock, currency 
and commodity prices. A clearer picture emerges when prices are recorded 
at regular intervals. We consider series for which the price is recorded 
once for every day that an appropriate market is open. 

The first objective of price studies is to understand how prices behave. 
This is such a complex subject that we cannot rely solely on theoretical 
explanations but must instead look in depth at actual prices. Statistical 
methods are the natural way to investigate prices and as computer technol- 
ogy advances we can learn more from longer price series using less 
computing time. Tomorrow's price is uncertain and it must therefore be 
described by a probability distribution. Many researchers have concen- 
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trated upon the problem of using past prices to estimate the average values 
of future distributions. This has usually produced the simple forecast that 
the best estimate of tomorrow's price is today's price. It is better to try to 
describe the entire probability distribution and this will be done in Chap- 
ters 2 to 7. The variance of the distribution is particularly relevant and we 
need to understand something about how it changes through time. 

A second objective of investigations into financial time series is to use 
our knowledge of price behaviour to take better decisions. Chapters 7 to 9 
explore practical applications of our price models. Improving on the 
simple price forecasts described above is difficult, but not always impos- 
sible. Decisions based upon better forecasts are profitable for trading 
currencies and commodities. The size of price-changes, ignoring their 
direction, can be predicted to some degree. This provides a way to 
anticipate and perhaps avoid the risk of a large adverse change in prices. 
Forecasts of the variances of future price-changes are very helpful for 
assessing prices at the relatively new option markets. 


1.22 ABOUT THIS STUDY 


Many studies of financial time series have been published. My study differs 
from previous work in four ways. Firstly, it covers a very broad selection of 
financial markets, listed later in Section 2.2. These markets are situated in 
New York, Chicago, London, and Sydney. They include several of the most 
important stock, currency, and commodity markets. Both spot and futures 
markets are considered in detail. Consequently, general results can be 
established without having to rely on UK markets, say, being like US 
markets or spot markets like futures markets. 

Financial time series do not conform with the usual requirements for 
orthodox time series analysis. The second innovation in this study is the 
development of suitable methods for analysing financial series. These 
methods have to overcome statistical problems caused by apparent 
changes in the variance of day-to-day price changes. Variance changes are 
rather elusive and the third feature of this book is a systematic investigation 
of variance models. 

There have been many arguments about the existence of price-trends 
but little conclusive evidence for them. This is partly because the trend idea 
is vague. My fourth contribution is a model-based approach to trends. This 
allows rigorous tests and uncovers quite a lot of evidence for trends in 
currency and commodity prices, albeit small ones. 

The remainder of this introductory chapter covers elementary informa- 
tion about financial markets and some models for time series. Major 
markets are described in Section 1.3 and a few examples of their price 
series are discussed in Section 1.4. A brief review of previous research into 
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financial time series follows as Section 1.5; many further studies are 
referred to in later chapters. 

Statistical methods are used throughout this text to describe prices and 
numbers obtained from prices. We will model day-to-day changes in the 
logarithms of prices. These changes are called returns and the reasons for 
analysing them are given in Section 1.6. Relevantcriteria for model building 
are described in Section 1.7, followed in Section 1.8 by a summary of how 
models are used in the later chapters. Our models are stochastic processes, 
which specify the multivariate distributions of returns in varying degrees of 
detail. An introduction to simple processes, including ARMA models, ts 
presented in Section 1.9. Most time series methods assume that the 
relevant stochastic process is linear, explained in Section 1.10. However, 
we willlater see that good models for financial returns must be non-linear. 

A short overview of the contents of Chapters 2 to 9 is given in Section 1.8 
after describing some necessary definitions and concepts in Sections 1.3 to 
1.7. 


1.3 THE WORLD'S MAJOR FINANCIAL MARKETS 


Stocks, commodities, currencies, and other goods are traded at financial 
markets and exchanges. There are many motives for trading. Some ex- 
amples are the reduction of business risks, purchases and sales of raw 
materials, and the investment of personal or corporate wealth. We con- 
sider those markets where prices change frequently and thus an invest- 
ment will produce an uncertain return. 

Market size can be measured by the value of the goods traded. Using this 
criterion, the largest markets are in New York, Chicago, and London. There 
are also important markets in Amsterdam, Frankfurt, Paris, Tokyo, Hong 
Kong, Sydney, Montreal, and many other cities. 

Various types of goods and investments are traded at financial markets. 
For example, suppose it is believed that gold prices will rise. Then action 
can be taken by trading spot bullion, futures contracts, or options. Spot, 
futures, and options markets offer a variety of trading possibilities. 

Spot markets are used for a straightforward and quick exchange of 
goods. The most interesting spot markets for investors are probably stock 
and share markets where all sorts of listed and unlisted securities are 
traded. Almost all the industrialized nations in North America, Western 
Europe, the Far East, and Australasia have active stock exchanges. Pro- 
ducers of primary commodities use spot markets to sell directly to con- 
sumers and manufacturers. 

Futures markets allow people to agree a price for exchanging goods at 
some later date, perhaps several months into the future. The buyer of a 
futures contract may sell it before the delivery date and thus avoid a 
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physical exchange of the goods bought. Likewise a seller can cancel an 
obligation to deliver goods by subsequently buying a contract. Many 
business risks can be reduced substantially by trading futures, an activity 
called hedging. Investors can back their predictions by trading futures, 
often called speculation, without having to own, transport or store physical 
goods. 

The Chicago Board of Trade and the Chicago Mercantile Exchange run 
very large futures markets and there are substanial markets in New York 
and London. Agricultural and mineral futures have been traded for several 
decades. These include markets for crops, animals, and metals, and very 
recently petroleum products. Some important examples are corn, cattle, 
gold, copper, and heating oil. Financial futures have been traded at 
Chicago since the early 1970s, and now have a dominant position in the 
futures industry. Transactions in Sterling, Deutschmarks, Swiss francs, 
Yen, Treasury bills and bonds have particularly high trading volume. In 
1982, futures for stock indices commenced trading in the US and a financial 
futures market was opened in London. 

An option gives its owner the right to engage in a particular spot or 
futures transaction at a formerly agreed price. The right does not have to be 
exercised; it is optional. Options can be traded on stocks, stock index 
futures, currency and gold futures, and other goods, at markets in Chicago, 
London, and Amsterdam. 

Spot and futures prices are analysed in great detail in this book. Option 
pricing is discussed but prices are not examined empirically. An under- 
standing of the behaviour of the associated spot or futures prices is 
essential for valuing an option. The empirical results for spot and futures 
prices have several implications for effective option trading. 

We will consider prices and only occasionally discuss institutional 
arrangements. Further information about the organization of financial 
markets and procedures for trading can be found in the short yet com- 
prehensive text by Geisst (1982), in Sharpe (1981), and in the books on 
stocks by Stonham (1982), on commodity futures by Teweles et al. (1974) 
and Granger (1979), and on options by Bookstaber (1981) and Cox and 
Rubinstein (1985). 


1.4 EXAMPLES OF DAILY PRICE SERIES 


Figure 1.1 shows nearly 2000 consecutive daily gold prices, covering the 
seven and a half years from April 1975 to December 1982. In the first half of 
this series, the spot price begins at about $180, falls to just over $100 during 
the following 18 months and then increases steadily to about $200. The 
second half of the series shows the price of gold rising sharply from $200 to 
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Figure1.1 Gold prices, in US dollars per ounce, daily from April 1975 to December 
1982 


a peak of $835 in just one dramatic year, followed by various fluctuations 
between $500 and $750 for the next year and a half, then a sustained fall to 
$300 and finally a recovery to $450. The highest price is more than eight 
times the lowest price, notan untypical range for a series of 2000 commod- 
ity prices. 

Figure 1.2 illustrates the result of investing $1 in the stock of General 
Motors at the New York Stock Exchange. This series covers the 11 year 
period from January 1966 to December 1976. Dividends have been re- 
invested and trading costs have been ignored. The investment shows a loss 
for the first four years and recovers to a breakeven position after seven 
years. Half the investment is then lost in the next two years. Fortunately the 
remaining 50 cents grow to more than $1.40 in the final two years. The 
highest value is nearly three times the lowest. 

Currency prices are relatively stable compared with stock and commod- 
ity prices. Figure 1.3 shows 2000 values of the dollar-sterling exchange rate 
between November 1974 and September 1982. The highest rate is just over 
one and a half times the lowest rate. The long decline at the beginning of 


6 Modelling tinancial time series 


Value 


0.0 -- -- | 1 = pm. | 
0 500 1000 1500 2000 2500 
250 750 1250 1750 2250 2750 


Time іп trading дау units 


Figure1.2 Value of $1 invested in General Motors stock on 1 January 1966, daily to 
December 1976 


the series from $2.40 to $1.60 lasts for two years. Then the rate climbs back 
to $2.40 during the following four years, although there are severaltempor- 
ary reversals. After regaining $2.40 there is a sudden collapse to $1.80, a 
slight recovery and then a final fall to $1.70. 

These first three figures show long-term substantial changes in prices. 
There are also major price movements within shorter periods of time. 
Figure 1.4 shows this for two sugar futures contracts. A year of prices are 
plotted for both the December 1963 and December 1966 contracts, traded 
at London in sterling units. Within 1963, the sugar price went from £33 to 
£97, fell to £40 and then surpassed its earlier high and went on to reach £102. 

All the figures demonstrate that there are frequent large and important 
changes in financial prices. Figure 1.4 also shows that there are large 
changes in the volatility of prices—prices changed very quickly in 1963 but 
changed much slower in 1966. Much of this book is about improving 
forecasts of future prices and the size of future price-changes. Satisfactory 
forecasts of these quantities are clearly very important. 


Introduction 7 


267 


Dollars per pound 


1.4 | | 1 l | 1 | | 
О 250 500 750 1000 1250 1500 1750 2000 


Time in trading day units 
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Figure 1.4 Prices in pounds per tonne for two December sugar futures. Repro- 
duced from Taylor and Kingsman (1978) by permission of the Operational Research 
Society Ltd 
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1.5 A SELECTIVE REVIEW OF PREVIOUS RESEARCH 


Important questions 


Graphs of financial prices, plotted against time, frequently appear to show 
trends in prices. An upward trend occurs when prices generally continue 
to rise for several days. A downward trend describes falling prices. The 
preceding figures certainly appear to display trends. Throughout this 
century, brokers, investment managers and individual speculators have 
plotted price pictures, tried to predict trends and then traded or persuaded 
others to do so. A moment's reflection shows us that for everyone who 
gains money by trading someone must have lost it. Therefore predicting 
prices cannot be easy. Further thought may cause us to believe that if one 
person can deduce a good trading system then other people will eventually 
discover it. Then the system will be used so much that eventually its profit 
potential will disappear. 

Accurate price forecasts and easy trading gains cannot and should notbe 
expected. Research into price forecasting and trading methods has been 
very extensive and usually based on detailed empirical analysis of prices 
and further information. Much of this research has attempted to answer 
two questions: 


(1) Is the most accurate forecast of tomorrow's price simply today's price 
plus an estimate of the long-run average daily price change? 

(2) Can profits be made by frequently changing a market position, buying 
and selling the same goods many times? 


These questions define two of the most important hypotheses for price 
research, which are now reviewed briefly and later assessed empirically. 


The random walk hypothesis 


All forecasts are based upon information. If we want to forecast tomorrow's 
price for a stock, say, then we might look at today's price, the prices on 
several previous days, and perhaps additional information. A plausible 
forecast is today's price because many traders will be (at least) reasonably 
up-to-date in their assessment of information about the stock. Over along 
period of time it will be noticed that stock prices have a long-run tendency 
to rise. Our plausible forecast could then be revised to be today's price plus 
the long-run average daily price change. Other forecasts might use today's 
price and a number of previous prices. 

The usual statistical criterion for assessing forecasts is their mean square 
error. This is the average value of the square of the difference between the 
forecast made and the actual value observed. The random walk hypothesis 
is a statement that price changes are in some way random and so prices 
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wander (“walk”) in an entirely unpredictable way. Consequently, forecasts 
based on today's price cannot be improved by also using the information 
in previous prices. Alternatively, the mean square error cannot be reduced 
by using previous prices. 

There are many ways to phrase the random walk hypothesis in statistical 
terms. Anumber of definitions have been published. In every case the best 
forecast of tomorrow's price requires today's price but not previous prices. 
Bachelier (1900) in a most remarkable thesis implied that price changes 
have independent and identical normal distributions. Fama (1965) removed 
the assumption of normal distributions. The hypothesis is then that price 
changes are independent and have identical distributions. Granger and 
Morgenstern (1970, рр. 71-3) do not require the price changes to be 
identically distributed. We follow their convention in this book. The 
random walk hypothesis is defined by constant expected price changes 
and zero correlation between the price changes for any pair of different 
days. Zero correlation will be sufficient, for practical purposes, to ensure 
that out-of-date prices are irrelevant when forecasting. 

Empirical tests of the random walk hypothesis have been published for 
nearly all the world's major financial markets. The earliest studies include 
important investigations by Working (1934), Kendall (1953), and Fama 
(1965). Working showed that several series of commodity futures prices 
strongly resembled an artificial series obtained by simulating a random 
walk. Kendall, after analysing wheat prices, cotton prices, and share 
indices, concluded that investors ought to assume that prices followed 
random walks. Fama studied the prices of all 30 stocks in the Dow Jones 
Industrial Average index in considerable detail. His results show that US 
stock prices either follow random walks or something very similar. 

Fama's paper rightly had asignificant impact on academic research. After 
1965, many researchers assumed that prices followed random walks and 
then sought answers to other questions about optimal investment de- 
cisions. Tests of the random walk hypothesis continued to be done with 
conclusions that tended to agree with Fama's. Prices at large stock markets 
appeared to be random walks. However, at smaller stock markets and at 
commodity futures markets slight deviations from random behaviour were 
often claimed. 

There have been so many random walk investigations that itis impractical 
to mention all the important ones. The following lists certainly contain 
many of the most interesting books and articles. 


(a) Stocks 
in the USA: Fama (1965) and Granger and Morgenstern 
(1970) 
in the UK: Dryden (1970a) and Cunningham (1973) 
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in Scandinavia: Jennergren and Korsvold (1974) and Jennergren 
and Toft-Nielsen (1977) 
in Australia: Praetz (1969) 


and in many countries: Cooper (1982). 
(b) Commodity futures 


in the USA: Labys and Granger (1970), Dusak (1973), and 
Cargill and Rausser (1975) 

in the UK: Taylor (1980) 

and in Australia: Praetz (1975). 


(c) Currency exchange rates: Cornell and Dietrich (1978) and Levich (1979). 


As will become clear in Chapters 2 to 5 inclusive, the statistical methodol- 
ogy used for random walk tests has often been inappropriate and some 
reported conclusions are questionable. New methods are presented and 
the random walk hypothesis is tested for several US and UK markets. 


The efficient market hypothesis 


When prices follow a random walk the only relevant information in the 
series of present and past prices, for traders, is the most recent price. Thus 
the people involved in the market have already made perfect use of the 
information in past prices. Suppose prices are available very frequently. 
Then if only the latest price is relevant it follows that prices very quickly 
reflect the information in the historical record of prices. A market will be 
called perfectly efficient if the prices fully reflect available information, so 
that prices adjust fully and instantaneously when new information 
becomes available (Fama, 1976, p. 140). 

In many economists' conception of an ideal world, financial markets 
would be perfectly efficient. Real markets do not use information perfectly. 
This statement will be justified later. However, many markets are so good 
at processing information that traders certainly ought not to be using the 
information to run trading systems. A trading system is simply some 
systematic method for repeatedly buying and selling the same goods. 
Information which is useless for trading systems can still be helpful. For 
example, it can be used to optimally design a diversified portfolio for 
long-term investment. 

A market for some asset will be called efficient if the results obtained by 
using certain information to trade are not better than the results obtained 
by using the information to help decide the optimal quantity (if any) of the 
asset in a static portfolio (cf. Jensen, 1978, p. 96). Results are measured by 
the risk-adjusted return net of all costs; costs include commission, taxes 
апа апу payments for acquiring information. Risk adjustments are essential 
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to ensure that trading strategies are compared with equally risky alternative 
investments. In this book we will only consider the information available in 
the present price and past prices. If the market is efficient with respect to 
this information we say that the efficient market hypothesis is true, 
otherwise the hypothesis is false. 

To be more precise, these statements refer to the so-called weak form of 
the hypothesis. This is the only version of the hypothesis investigated in 
the text. Expanding the set of information to include all relevant publicly 
available information about the asset leads to a semi-strong form of the 
hypothesis. Including private information gives the strong form. Further 
details and reviews of relevant research appear in Fama (1970, 1976) and 
Jensen (1978). Grossman and Stiglitz (1980) present persuasive theoretical 
arguments against perfect efficiency. Their model is approximately com- 
patible with semi-strong efficiency in the manner discussed here: 
informed traders must be compensated for acquiring costly information by 
prices revealing such information slowly. We discuss definitions of an 
efficient market again in Section 8.2. 

Many people have used trading systems to investigate the (weak form) 
efficient market hypothesis. Some researchers have rejected the random 
walk hypothesis and have then wanted to know if the departures from 
random behaviour can be exploited. Other people have not trusted 
statistical tests to find evidence for price patterns and have preferred to 
look at trading systems instead. 

The most popular trading rule in academic research is the filter rule of 
Alexander (1961). This rule assumes there are trends in prices. The inten- 
tion is to buy when it is believed an upward trend has already begun and 
then to sell as soon as there is sufficient evidence that a downward trend 
has commenced. Alexander's rule can begin by supposing an asset is 
bought on day number í. It is then sold on the first day j for which the price 
is x percent less than the highest price between days i and j — 1 inclusive. 
The trader also goes short if possible on day j. When the price is x percent 
more than the least price on or after day j then the asset is once again 
bought. The parameter x is usually assessed within the range 0.5 to 25. 

Filter trading results depend on the type of market in much the same way 
as the results of random walk tests. Gross trading profits are generally less 
than the costs of trading for stock markets. The opposite conclusion has, 
however, been claimed for certain commodity futures markets. Some of 
the more interesting studies are the following. 


(a) Stocks 
in the USA: Fama and Blume (1966) 
in the UK: Dryden (1970b) 


and in Sweden: Jennergren (1975). 
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(b) Commodity futures 


in the USA: Stevenson and Bear (1970) and Leuthold (1972) 
and in Australia: Praetz (1975). 

(C) Metals 
in the UK: Bird (1985). 


(d) Currency exchange rates: Cornell and Dietrich (1978). 


Praetz (1976a, b) and others have shown that the methodology frequently 
used to assess trading rules is inadequate. Appropriate methodology and a 
critical review of previous results are presented in Chapter 8. New trading 
rules and empirical results are also presented for commodity and currency 
futures markets. 


1.6 DAILY RETURNS 


Direct statistical analysis of financial prices is difficult, because consecutive 
prices are highly correlated and the variances of prices increase with time. 
Prices are not 'stationary', a concept introduced in Section 1.9. Con- 
sequently, it is more convenient to analyse changes in prices. Results for 
changes, for example a forecast, can easily be used to give appropriate 
results for prices. 

Suppose the price is recorded once on each trading day, always at the 
same time of day. Closing prices are usually recorded. Let z, be the price on 
trading day t and let d; be the dividend (if any) paid during day t; d, will only 
be non-zero for stocks and then only on a few days every year. Three types 
of price changes have been used in previous research: 


xt = Z + d, — 2-1, 
x, = log (z, + d,) — log (z,_-,), and 
x, = (z, + d, — z,_()/z,_1. 


The first differences x? depend on the price units, so comparisons between 
series are difficult. They have the further disadvantage that their variances 
are proportional to the price level. For these reasons, either the x, or the x; 
are nearly always studied in modern research. 

One unit of money, invested іп 1/2, , items at the price 2, 1,15 worth 


(z^dYz ,214xp2e*-1-c4xtihdit. (1.6.1) 


when the price z, is recorded on day t. Thus x; is the one day rate of return 
obtained on the investment, whilst x, is the rate of return with continuous 
compounding; for exp (x) is the limit of (1 + x;/n)" as n о. 

We see from (1.6.1) that x, and x; are almost equal, since they are nearly 
always within the range —0.1 to 0.1. 

Some researchers prefer to study the compound return x, others the 
simple return x;, and those who have studied both find that the important 
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conclusions are the same for each type of return. In this book we investi- 
gate compound returns. The adjective 'compound' will only be used when 
it is particularly helpful, so generally x, is called the return on day t. 

There are two major reasons for preferring the compound definition. 
Firstly, continuous time generalizations of discrete time results are then 
easier and secondly returns over more than one day are simple functions 
of single day returns. For example, at the end of day t 1 the original unit 
of money is worth 


ехе = (1 + x) xia). 
Ignoring dividends, the two-day returns are 

X12 = log (2,44) — log (2-1) = X, + X44 (1.6.2) 
and 

Xt412 = (Zia 7 Zi MZ = Х + Mtg + ХХ +. (1.6.3) 


The former equation, (1.6.2), is the easiest one to work with when consider- 
ing the total return over two days; likewise the x, provide straightforward 
results for total returns over more than two days. 

All our definitions of return give nominal results, that is they ignore 
inflation. So-called real returns, which take inflation into account, cannot 
be calculated sensibly for daily series so we follow the accepted convention 
of looking at nominal returns. Nevertheless, the consequences of inflation 
for random walk and efficient market tests are considered later on. 

The returns definitions also ignore trading on margin. This is considered 
unimportant. If a unit of money can be used to finance a purchase of g/z,_, 
items, with gearing g > 1, one day later the investment will be worth 


1+g(z, + d, — 72,_\)/2—у=1 + gx, = 1 + gle*— 1). (1.6.4) 


The return on the geared investment is thus g times the ungeared return, 
exactly for simple returns and approximately so for compound returns. It 
can be argued that no capital is needed to finance trades at US futures 
markets in which case returns definitions like (1.6.1) and (1.6.4) have no 
clear economic interpretation. In these circumstances x, remains an im- 
portant number for modelling prices and testing hypotheses. 

Finally, note that interest paid on currencies bought at a spot market and 
deposited in the banking system does not appear in the definition of x,. 
Adjustments for interest rates are given later, in Section 6.11. 


1.7 MODELS 


Financial prices and hence returns are determined by many political, 
corporate, and individual decisions. A model for prices (or returns) is a 
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detailed description of how successive prices (or returns) are determined. 
We will say that the description contains enough detail to be called a model 
if it can be used to simulate prices. A good model will be capable of 
providing simulated prices which look just like real prices. For such a 
model, if we gave someone along series of real prices (from an un-named 
market) and an equally long series of simulated prices then the person 
could only guess which of the two series was real. Thus agood model must 
describe all the known properties of recorded prices. 

Models have been constructed using concepts from statistics, econ- 
omics, and other sciences, such as psychology. The random walk hypoth- 
esis can be stated as a statistical model of price behaviour. However, the 
efficient market hypothesis is too general a statement about prices to be a 
model. It does not give enough detail for simulations. Instead, the 
hypothesis asserts that only a certain class of models should be considered. 

Models can be conjectured from data or can be suggested by economic 
theory. Our models will be constructed by studying time series of price 
data, to obtain a probabilistic description of price behaviour. The models 
obtained are stochastic processes and a review of some relevant definitions 
follows in Section 1.9. 

Any model will only be an approximation to the rules which convert 
relevant information and numerous beliefs and actions into market prices. 
Some approximations will be more accurate and helpful than others. We 
will seek models satisfying five criteria. 

Firstly, models should be consistent with past prices. Secondly, hypoth- 
eses implied by a model ought to be amenable to rigorous testing so that, 
in principle, the model is falsifiable. A potentially falsifiable model may be 
called a scientific model. These first two criteria cause models for returns 
to be quite complicated if there are plenty of past prices for tests, as will be 
shown from Chapter 2 onwards. A third criterion is that models should 
nevertheless be as simple as possible. Few parameters are preferable to 
many, the so-called principle of parsimony. 

The preceding criteria aim for model accuracy. For a model to be helpful 
in practice the following further criteria are essential. Fourthly, a model 
should provide forecasts of future returns and prices, which are statistically 
optimal assuming the model is correct. It is even better if probability 
distributions for future prices can be calculated. Fifthly, it is obviously 
beneficial if a model can be used to aid rational decision making. A 
satisfactory model will have implications for trading the asset considered 
and for pricing options to trade it at a later date. Simulation of the model 
may well indicate the best ways to take trading decisions. 

To illustrate the application of the various criteria, consider a very simple 
model, the Wiener process. This model states that the return during 7 time 
units follows a normal distribution with mean and variance each propor- 
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tionalto T; also,that returns duringanytwo non-overlapping time intervals 
are independent. The model was used by Black and Scholes (1973) to derive 
their famous and often used option pricing formula. A Wiener model is 
most scientific as it implies, inter alia, normal distributions, constant daily 
variance and independent returns and all these hypotheses can be tested. 
Indeed, some of the hypotheses are always rejected if a few years of prices 
are available. The Wiener model has two parameters and only the variance 
of daily returns needs to be estimated for option pricing, thus the model is 
certainly parsimonious. And, finally, probability distributions are easy to 
obtain for future dates, leading to rational option values. 


1.8 MODELS IN THIS BOOK 


Stochastic processes will be used to model daily returns. General defi- 
nitions and some important examples are given in the remainder of this 
introductory chapter, see Sections 1.9 and 1.10. 

Chapter 2 reviews many statistical characteristics of returns for a com- 
prehensive selection of financial goods. Afterwards, we seek models able 
to match the most important characteristics. It is shown that the search for 
adequate models must take place outside the familiar framework of linear 
processes. The reason for this is changes in either the unconditional or 
conditional variance of returns. 

Chapter 3 presents various models for the time-dependent variance of 
returns. Chapter 4 compares the accuracy of these models by assessing 
appropriate forecasts linked with optimal forecasts of future standard 
deviations. Chapter 5 evaluates the consequences of variance changes for 
random walk tests. It is suggested that returns are rescaled before tests are 
performed. 

Chapter 6 tests the random walk model and special models derived from 
the idea of perfect markets. Several test statistics are considered. In 
particular, statistics are deduced from an alternative class of models 
containing trends in prices. Random behaviour is rejected for all the long 
series tested. Trend models are preferred for commodities and currencies 
but not stocks. Stocks reflect relevant information quickly if not perfectly. 

Chapter 7 describes price-trend models in detail and shows how to 
estimate the model parameters. This completes the modelling of returns 
and the rest of the book covers applications. The chapter continues with a 
description of optimal forecasts for trend models and compares their 
empirical accuracy with random walk forecasts. 

Chapter 8 considers the efficiency of futures markets by examining some 
theoretical and practical implications of trading rules constructed from 
price-trend models. These rules are good enough to create doubts about 
the applicability of the efficient market hypothesis to futures. 
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Chapter 9 discusses the implications of both variance and trend models 
for calculating the values of options. Option values are particularly sensi- 
tive to the variances of returns. Theoretical results demonstrate the import- 
ance of good variance models and satisfactory variance estimates. 

Chapter 10 concludes the main text. It is followed by an appendix 
describing a computer program which readers may use to help model their 
own financial time series. 


1.9 STOCHASTIC PROCESSES 


General remarks 


Let us now suppose that {x,, X2, X3,...,X,} is any observed univariate, time 
series, so that x, is a single number recorded at time t and observations are 
available for n consecutive times. In our examples, twill count the number 
of days a market has been open after the first price is recorded for some 
investigation. Before a particular time t, the value of x, will almost certainly 
not be known. Thus we may consider x, to be the realized value of some 
random variable X,. A sequence 1... Xi, X, Xs, ..., X, ...) of random 
variables is a stochastic process and we may wish to include variables for all 
times on an infinite scale. Stochastic processes will often be denoted by a 
typical variable in curly brackets, e.g. {X,}. Sometimes they are called the 
process generating particular data or they are simply called a process or 
model. 

Time series analysis is primarily the art of specifying the most likely 
stochastic process that could have generated an observed time series. We 
now review the definitions and properties of various stochastic processes. 
These have provided successful models for many physical and economic 
series. The first four chapters of Granger and Newbold (1977) provide a 
more detailed and mathematical description of useful stochastic pro- 
cesses. We will here highlight relevant information for empirical research 
involving financial time series. 


Stationary processes 


Any stochastic process will have parameters, such as the means of the X,. 
Realistic estimation of the parameters will not be possible if they change at 
all frequently as time progresses. The most practical models will be those 
whose parameter values all remain constant. This will happen if any 
multivariate distribution of the X, does not depend on the choice of times. 

A stochastic process {X,} is said to be strictly stationary if for all integers 
i, j and all positive integers k the multivariate distribution function of 


(Xi, Хат, +++) X) is identical to that of (X, X; 4, ---, X44). In practice 
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it is only possible to test some of the consequences of the assumption that 
a process is strictly stationary. Two consequences are particular important. 

First, as X and X, have identical distributions their means are identical, 
thus E[X,] equals some constant д. Second, because the pairs (X, X+) and 
(X, X.) have identical bivariate distributions it follows that the auto- 
covariances 


cov (X,, X.) = EIX — Ш). Ш = А, (1.9.1) 


depend on the time lag т alone. This will be true for all т. In particular the 
X, have constant variance Ag. Note that in this chapter we assume the X, 
have finite means and variances. We comment on infinite variance models 
for returns in Section 2.8. 

A stochastic process whose first and second order moments (means, 
variances, and covariances) do not change with time is said to be second 
order stationary. We will abbreviate this to stationary. 

Appropriate stochastic processes for financial prices are generally 
agreed to be not stationary. Inflation increases the expected values of most 
prices as time progresses. Thus the first moment changes. Deflating prices 
could give constant expected values. However, the variances of prices 
increase with time. This is obvious for a random walk. For if {Z,} is a model 
for prices or their logarithms and if з, = Z, — 4;_1 is uncorrelated with 2, 1 
and has positive variance, then 


var (Z,) = var (Z, 1 + m) = var (Z,;_1) + var (gy) > var (Zi 4) 


so the variances depend on the time t. 

Although the price process is not stationary it is quite possible that 
returns can be modelled by stationary processes. We will provide evidence 
about the stationarity of the returns process on several occasions in later 
chapters. 


Autocorrelation 


The correlation between two random variables X; and X,,, obtained from a 
stationary process is called the autocorrelation at lag r. It will be denoted 
by p,. As X; and X;,, both have variances equal to Ao, 


p, = соу (Х;, X;4,)/Ag = Al Ag. (1.9.2) 


Clearly рр = 1, also p, is defined for all integers тапа p, = p ,. Asp, iS а 
correlation coefficient, —1 x p, x 1. 

If we study a non-stationary process for which cor (X, X,,,) depends оп 
7 alone then we will still use the notation p, to denote the autocorrelation 
at lag т. For example, the X, could have time-dependent autocovariances 
yet possess autocorrelations determined by the time lag between variables. 
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Table1.1 Definitions of certain types of stochastic processes 


Aprocessis... If... 
Strictly stationary The multivariate distribution function for k consecutive 
variables does not depend on the time subscript 
attached to the first variable (any K). 
Stationary Means and variances do not dependon time subscripts, 
covariances dependonly on the difference between 
thetwo subscripts. (Strictly stationary implies 


stationary.) 

Uncorrelated The correlation between variables having different 
subscripts is always zero. 

Autocorrelated Itis not uncorrelated. 

White noise Itis stationary and uncorrelated. 

Strict white noise The random variables are independent and have identical 
distributions. 

Linear Itis a linear combination of the present and past terms 
from astrict white noise process. 

Gaussian All multivariate distributions are multivariate normal. 


Stationary and Gaussian implies linear. 


In particular, we will say that the process generating prices is arandom walk 
if the process generating returns has constant mean and zero autocorrela- 
tions at all positive lags, even if the variances of returns appear to be time 
dependent. 

Any process, stationary or non-stationary, will be called uncorrelated if 
the correlation between X, and X,., is zero for all jand all т> 0. The adjective 
autocorrelated is used if a process is not uncorrelated. Table 1.1 sum- 
marizes the definitions of various types of processes. 

An important property of the autocorrelations of a stationary process ts 
that they are sufficient to obtain the optimal /inear forecasts when optimal 
means least, mean squared error. For example, if 


Fa = et ó+ > a(X_; — д) 
i-0 
is a linear forecast of X,,,, with дапа the a; (i = 0) being constants, then it 
is easy to show that E[(X,,, — F,,)2] equals 8? plus a function of the numbers 
a; and p,. Thus the optimal, linear forecast will be unbiased (ó = 0) with the 
best a; depending only on the sequence p,, 7 > 0. 


Spectral density 


The autocorrelations p, and the variance Ay conveniently summarize the 
second order moments of a stationary process. We will frequently use 
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these quantities. Another representation of the second order moments is 
the spectral density function, which can be defined by 


slw) = Ао + 1 У A, cos (то) (0 < о < 27). (1.9.3) 
2п т 1 


The integral of (о) over the interval from 0 to 27 is А. High values of s(w) 
might indicate cyclical behaviour at frequency о with the period of one 
cycle equalling 27/w time units. The frequency-domain function s(w) is 
more difficult to estimate and comprehend than the time-domain se- 
quence A,. Consequently, we will concentrate on time-domain methods. 
Spectral tests of the random walk hypothesis can nevertheless be import- 
ant. They will be discussed and compared with autocorrelation tests in 
Chapter 6. 


White noise 


The simplest possible autocorrelations occur when a stationary process 
consists of uncorrelated random variables so that 


Pp = 1 and р, = 0 for all + > 0. 


The optimal linear forecast of X,,, will then Бе its mean и, for any 
realization of the present and past variables (X, ;, j 2 0). A stationary and 
uncorrelated process is called white noise, because its spectral density 
function equals a constant value for all frequencies o. When, furthermore, 
the process has E[X,] = 0 it is referred to by the name zero-mean white 
noise. 

It is important when modelling financial returns to appreciate that if (X, ) 
is white noise then X, and X, are not necessarily independent for i # j. Zero 
autocorrelation does not ensure that the distribution of X is independent 
of a realized value x,, even when the X, have identical, unconditional 
distributions. Examples are presented in Chapter 3. If, however, the X, are 
independent and stationary then the stochastic process is called strict 
white noise. 

The random walk hypothesis implies zero autocorrelation among 
returns. Various definitions of the hypothesis have been offered including: 


(a) The returns process is strict white noise, 
(b) The returns process is white noise, 
(c) Returns are uncorrelated and their process is not necessarily stationary. 


We will later see that it is easy to test and refute (a) but it is difficult to test 
(b) and (c). 
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ARMA processes 


A zero-mean white noise process (e) can be used to construct new 
processes. We will describe three examples, used to model various returns 
or derived series in later chapters. Afterwards we present the general 
autoregressive-moving average (ARMA) model. 

First, consider a process (X,) defined by 


X — w= alX 7 д) + E, (1.9.4) 


so that X, depends linearly on X, | and the innovation e, alone. The process 
{X,} is called an autoregressive process of order 1, abbreviated to AR(1). It 
is stationary if, as we will always assume, the autoregressive parameter a 
satisfies the inequality la} < 1. Since F[X,] = ЕХ, 1] fora stationary process 
and Ele] = 0, it follows immediately that the mean parameter is simply 
и = E[X,]. The remaining parameter of an AR(1) process is its variance, 
Ay = var (X) = var (&/(1 — a’). 

Convenient results can often be obtained by using the backshift operator 
B, defined by Ba, = a,_, for any infinite sequence of variables or numbers 
{a}. In particular, B^X, = X, хапа B“, = u forall positive integers k. Thus 
(1.9.4) can be rewritten as 


(1 — aB)(X, — ш) = е. (1.9.5) 


Since |a| < 1, we may correctly write 


1 o0 
= By! 
1 — aB p sab) 
and therefore 
1 © : - . 
A cU ma cupo 2 Et (1.9.6) 


Thus X, is also an infinite-order weighted average of the present and past 
innovations. 

We can now deduce the autocorrelations of the AR(1) process. Multiply- 
ing both sides of equation (1.9.4) by X,_, — wand taking expectations gives 


А, = aÀ,., + Ele (X... — ш). (1.9.7) 


Ву (1.9.6), £(X,., — д) = > a’e,e,_,_;. As { є, } is white noise any term eje. ,., 
has zero expectation if 7 + i > 0. Thus (1.9.7) simplifies to 


A, = aÀ,.4 (all 7 > 0) 


and consequently A, = а'А„. An AR(1) process therefore has autocorrela- 
tions p, = a’. 
Second, consider the process {X,} defined by 
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X, = и  & + b& 4 (1.9.8) 


so now X, is a linear function of the present and immediately preceding 
innovations. This process is called a moving average process of order 1, 
summarized by MA(1). It will always be stationary with mean ш and if 
|b| < 1, as we will always assume, optimal forecasts can be calculated. The 
variances of X, and &, denoted by A, and o? respectively, are from (1.9.8) 
related to each other by Ay = (1 + b?)a?. The autocovariances are simply 


À, = cov (X, Xir) == E[(e, T be AY (6. + beg,,,-1)] 


which will be zero whenever r > 1, whilst A, = ba?. Consequently the 
autocorrelations of an MA(1) process are 


b 


m p, = Oforall 7 > 2. 


Py 
The jump to zero autocorrelation at lag 2 may be contrasted with the 
smooth, geometric, decay of AR(1) autocorrelations. 
Third, consider the natural combination of the AR(1) and MA(1) models, 


X, — и = a(X, + ш) + e + be, 1. (1.9.9) 


This mixed model is an autoregressive-moving average process, denoted 
ARMA(,1). It will be used extensively to model various sertes derived from 
returns. We only consider the models for which la| < 1 and thus {X} is 
stationary and |b| « 1 so that, as soon shown, optimal linear forecasts can 
be obtained. Once more the mean is ш. Using the backshift operator, the 
model can be represented as 


(1 — aB)(X, — ш) = (1 + БВ)є,. 


A pure moving-average model is given by 


1+ БВ I ADIT 
X-mas ey «- (5 ав + БВ), 


i-0 


lI 


& + (a + b) > аб... (1.9.10) 


і-1 
Likewise, a pure autoregressive model can be obtained, 
1- aB 
1+ bB 


(X — ш) = (1 — [x (—bB) |ж = py =, 


i-0 


and simplifying gives 


X — w= (a + b) X (-b) (X ; u) + &. (1.9.11) 


ігі 
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From (1.9.10) it is obvious that any product (Х,_, — u)&.; has zero 
expectation if 7 > j. This means that if we multiply both sides of (1.9.9) by 
X- — м and take expectations, A, = aA, for T = 2 whilst т = 1and 7 = 0 
respectively give 


A, = aÀo + bo? and (1.9.12) 
Ào = aA, + (1 + ab + b2)o2, 


yet again making use of (1.9.10) with а? = var (&). Eliminating о? from the 
preceding equations, the autocorrelations of an ARMA(1,1) process are 
found to be 

р, = (ада t al r>1. (1.9.13) 
These autocorrelations decay at the rate р, ,/p, = а (т > 1) like those of an 
AR(1) process, but unlike an AR(1) the ARMA(1,1) process has р; # a if 
b = 0. We will only be interested in models for which a is positive, b is 
negative and а > |b|. Then p, сап be much less than a if a + b is a small 
number. 

The optimal linear forecast of X,,, as a function of random variables 
realized at time t or earlier can be deduced from equation (1.9.11). As &,,, 
is uncorrelated with every X, and е,,5 < t, the mean square error of a linear 
forecast must Бе at least the variance of &,,. This minimum value is 
attainable by substituting zero for the innovation £4, in an equation 
defining X,,,. By replacing every t in equation (1.9.11) by t + 1, it is 
deduced that the optimal linear forecast of X,+ is 


Fa = p + (a + b) Y (b) uua - и). (1.9.14) 
i=1 


It isthen easy to obtain the more convenient formula 
Fa =p + (a + bX — ш) — bti — и). (1.9.15) 


This formula will be used on anumber of occasions with estimates 2, â, and 
b replacing the parameters u, a, and b, and a realized value x, replacing X, 
to give a realized forecast. 

To forecast further into the future consider the following equation given 
by repeatedly using the definition (1.9.9): 


N 
Хам = ыт аў х= м) + > Gets, N > 1, 
= 


with each c; determined by a, Б, and N. As the best linear forecast of е,;; 
(i > 0) using variables X,_, (j = 0) is simply zero, itis possible to deduce the 
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optimal linear forecast of X,,4 made at time t. Denoting this optimal 
forecast by F, yit is 


Fn = м + a NEVER (1.9.16) 


The optimal linear forecasts for AR(1) and MA(1) processes can be deduced 
from (1.9.15) and (1.9.16) by substituting b = 0 and a = 0 respectively. 

More general ARMA processes are defined by using p autoregressive 
and q moving-average parameters: 


p q 
Х-и = aX и) + У bis. (1.9.17) 
1=1 j-0 

with b, = 1,a, = O and b, z 0. This defines an ARMA(p, q) process. It will 
be stationary if all the solutions of a,z + az? + ++ + az? = 1are outside 
the unit circle, |z| = 1, z being a complex number. The process is said to be 
invertible if optimal linear forecasts can be obtained, which requires all 
solutions of 1 + biz + --- + baz? = 0 to be outside the unit circle. 

Box and Jenkins (1976) describe an effective strategy for selecting an 
appropriate ARMA model, first published in 1970. Since then these models 
have been used to explain many economic and financial time series, see for 
example, Granger and Newbold (1977). Most models fitted to real series 
have p + q < 2asin the three examples we have discussed. 


Gaussian processes 


The random variables X, defining a stationary process can have any proba- 
bility distribution. A stationary process is called Gaussian if the joint 
distribution of (X;,4, X27 ..., Xek) is multivariate normal for every posi- 
tive integer k. A stationary Gaussian process will be strictly stationary 
because the first and second order moments completely specify the joint 
distributions. Also, Gaussian white noise is strict white noise since uncor- 
related is equivalent to independent for bivariate normal distributions. 

Although returns are certainly not generated by a stationary Gaussian 
process we will show in Chapter3 that interesting models for returns can 
be constructed using Gaussian processes. 


1.10 LINEAR STOCHASTIC PROCESSES 


Their definition 


Any stationary ARMA process is also a moving average of white noise 
innovations (£j, perhaps of infinite order. Equation (1.9.6) gives an 
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example and shows that an AR(1) process can be represented by an MA(«) 
process. In general, a stationary process may have 


-u= > bg. (1.10.1) 
j=0 


for some zero-mean white noise process {¢,} and constants b,. The 
innovations will be uncorrelated but may not be independent. There will 
be a representation (1.10.1) for a stationary process if it does not contain а 
deterministic component (Granger and Newbold, 1977, p. 36). 

Equation (1.10.1) states that X, is a linear function of the e, ;. A stationary 
process (X,) is said to be linear if it can be described by (1.10.1) with 
independent innovations, i.e. (&) is strict white noise. It is certainly 
possible that a stationary process is not linear and has no deterministic 
components. In fact, a white noise process need not be linear, as will be 
shown in Chapter 3. Stationary Gaussian processes are always linear. 

The distinctions between white and strict white noise and between a 
linear and non-linear stationary process are extremely important, yet they 
have often been neglected in finance research. We will show in Section 
2.10 that non-linear models are required for daily returns so their innova- 
tions cannot be strictly white although they might be white noise. Later on, 
in Chapter 5, we will quantify some consequences of using random walk 
tests upon non-linear processes when the tests are only valid for linear 
processes. 


Autocorrelation tests 


A time series of n observations, { х1, х, ..., Xa}, сап be used to estimate 
the autocorrelations p, of the process generating the observations, assum- 
ing that the process is stationary. Let x be the sample average, > x,/n. Then 
we estimate the p, by the sample autocorrelations 


Y x XX+ — X) 
ps бу. = Я r> 1. (1.10.2) 
> (хх 


t= 


- 


We сап consider r, їо Бе the realized value of а random variable 
К, = > (X, — Х(Х,,- X)/> (X, — Х)?, using the same summation limits as 
n (1.10.2). The estimates r, are often used to test hypotheses about the 
theoretical autocorrelations р,. Afterwards an appropriate stochastic 
model may be deduced. We need to know the distributions of the variables 
R, to perform hypothesis tests. 

Detailed results about the distributions of the R, are available for large 
samples from /inear processes. Comparable results are not known for 
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non-linear processes. Anderson and Walker (1964) proved a very general 
theorem about the multivariate distribution of (R,, №, ..., Ку) for linear 
processes. It may be stated as follows. Consider a process (X,) defined by 


X, — w= > Б-у 
j=0 


having innovations e, independently and identically distributed with finite 
variance and also X |b;| and > jb? finite (summing over j > 0). Then the 
asymptotic distribution, as n > х, of 


Vn(R, — p, R, — px, ..., Re — pk) 


is multivariate normal with all means zero and covariance matrix W, 
determined by the complete sequence p,, 7 > 0. All useful, finite variance, 
linear processes satisfy the conditions of the theorem. 

Practical approximations can be obtained for long time series. In particu- 
lar, for independent and identically distributed X, (strict white noise), 
possessing finite variance, W, issimply the k x kidentity matrix. Therefore, 
for large n, 


R, — N(0, 1/n) approximately, (1.10.3) 


and furthermore R, апа К, are approximately independent for all í >< j. The 
notation М(и, a?) indicates the normal distribution whose mean and 
variance are u and o2, respectively. 

These conclusions are well known. However, the assumptions required 
to make these conclusions correct are often misquoted. Two misunder- 
standings are common. Firstly, the standard result (1.10.3) is generally false 
for a non-linear, uncorrelated process, even if it is finite variance, white 
noise. Such processes can have var (R,) far greater than 1/n. Thus if sample 
autocorrelations are significantly different from zero, using (1.10.3), then 
we cannot correctly reject the hypothesis that the X, are uncorrelated. All 
we could say is that the X, are not independent and identically distributed. 
Secondly, the X, do not need to have normal distributions for approxima- 
tion (1.10.3) to be reasonable. The essential distributional requirement is 
merely finite variance. 


Chapter 2__ 


Features of Financial Returns 


The most important characteristics of daily returns are discussed in this 
chapter, before developing models and testing hypotheses in later chap- 
ters. Empirical results are gleaned from many sources and compared with 
results for forty time series filed at the University of Lancaster. These series 
are described in Section 2.2. Sources of prices and guidelines for construct- 
ing a financial time series are given in Section 2.1. From Section 2.3 
onwards, the means, variances, distributions, and autocorrelations of daily 
returns are summarized. The chapter ends with a demonstration that 
non-linear stochastic processes are needed to model returns. 


2.1 CONSTRUCTING FINANCIAL TIME SERIES 


Sources 


Prices can be obtained from university research centres, commercial 
organizations, some financial markets, newspapers, and other periodicals. 
The most comprehensive database of US stock prices and returns is 
maintained by the Center for Research in Security Prices, located within 
the University of Chicago. Daily returns on every common stock listed on 
the New York and American stock exchanges, beginning in 1962, can be 
bought for a few thousand dollars. There is no comparable source of daily 
European stock prices. Share price series available from commercial 
organizations are usually not long enough for intensive research. Futures 
prices are filed in another university database, supervised by the Center for 
the Study of Futures Markets at Columbia University, New York. Prices are 
kept for all important US, Canadian, and UK markets, in some cases 
recorded before 1970. 

Collecting your own prices can be very time consuming. The annual 
handbooks published by the Chicago Board of Trade and the Chicago 
Mercantile Exchange are cheap and provide daily closing prices and other 
information. Several daily newspapers publish prices, for example the Wall 


26 


Features of financial returns 27 


Street Journal, the New York Times, the International Herald Tribune, and 
the Financial Times. 


Time scales 


Price series have been studied for time scales ranging from every trans- 
action, through daily, weekly, and monthly prices to annual prices. This 
book is exclusively about the analysis of daily prices, usually recorded at 
the close of a day's trading. One price per day is nearly always the most 
appropriate frequency in my opinion. The exceptions could be so-called 
thin markets where there are often no trades during a day; one price per 
week may then be suitable. Studying the prices at which every transaction 
was made produces vast amounts of data after a few months. A few 
examples of transaction price studies are Niederhoffer and Osborne 
(1966), Oldfield et al. (1977), and Martell and Helms (1979). A series of daily 
prices or returns contains useful information missing in a series of weekly 
or monthly prices. The extra information improves variance estimates, 
increases the power of random walk tests, and is essential for investigations 
of trading rules. 

The duration of calendar time covered by a time series should be as long 
as possible. In my experience, at least four years of daily prices (i.e. 1000 
observations) are required to obtain interesting results and if possible 
eight years of prices (2000 observations) or more should be investigated. 
The relationship between series size and the accuracy of test results is 
explored in Section 6.10. Over several years the best descriptive model 
may of course change. All the same itis better to have extra data and to seek 
solutions to any problems caused by changes in the process generating 
prices. 


Additional information 


As well as obtaining closing prices it can be helpful to collect daily opening, 
high and low prices, and daily trading volume. The additional prices can be 
used to improve variance estimates (Parkinson, 1980; Garman and Klass, 
1980); however, this interesting idea will not be pursued here. Trading 
volume will indicate when a market is thin. Many instances of zero volume 
would suggest that published prices do not necessarily describe the prices 
at which transactions could have been agreed. Newspapers sometimes 
publish the closing bid and ask prices. When the bid-ask spread is large a 
thin market is again indicated. Actively traded goods have very small 
spreads and the average of the bid and ask prices can be used to define the 
closing price. 

A low bid-ask spread does not ensure any trades could take place at or 
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very near the closing price. A steady flow of transactions towards the end 
of the day is required to make the closing price a feasible transaction price. 
Thin trading is rarely a problem for the stocks of large companies and 
carefully selected futures contracts. 

Prices recorded at different times must not be averaged as this creates 
correlation between consecutive changes in the averages. A week-long 
average of daily prices is no good (Working, 1960). Neitheris the average of 
the day's high and low prices or the day's high acceptable: both pos- 
sibilities lead to spurious correlation (Daniels, 1966, corrections in Rosen- 
berg, 1970). 


Using futures contracts 


Long series of futures prices can only be constructed if several contracts 
are used. This causes few problems if the derived time series contain one 
return per trading day and each return is calculated using two prices for the 
same contract. A contract should contribute prices for a period when it is 
not thinly traded. Sellers of futures deliver goods to buyers on some or all 
days of the delivery month named in the contract. It is necessary for some 
futures and always advised to keep returns during the delivery month out 
of a constructed time series. Detailed examples follow in the next section. 


2.2 PRICES STUDIED 


To illustrate the statistical methods recommended later and to obtain new 
conclusions about price behaviour, we consider 23 spot series and 17 
futures series. All of them provide series of daily returns. Prices and returns 
have been given to me by many generous people; their names can be 
found in the Preface. Some prices have been copied out of newspapers and 
commodity exchange handbooks. 

Long series of prices can rarely be perfectly accurate. Series have been 
checked for errors, when possible, by comparing the prices causing the 
largest returns against alternative sources. The number of errors remaining 
is almost certainly so low that the major results are not affected. 


Spot prices 


Prices for 15 US stocks, the Financial Times 30-share index, 6 metals, and 
the dollar/sterling exchange rate are studied. Table 2.1 summarizes the 
series. Тһе US stock series are the data previously analysed by Giaccotto 
(1978) and Ali and Giaccotto (1982). There are 2750 returns for each of the 
first fifteen stocks in the Dow Jones Industrial Average, covering the period 
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Table 2.1 Description of spot series 


Dates 
Number of 
Series Market From To daily returns 


Stocks 

Allied New York Jan. 1966 Dec. 1976 2750 
Alcoa 

American Can 
ATandT 

American Brands 
Anaconda 
Bethlehem 
Chrysler 

Dupont 

Kodak 

General Electric 
General Foods 
General Motors 
General Telephone 
Harvester 


Stock index 
Financial Times 30 London July 1975 Aug. 1982 1794 


Metals 

Gold London Apr. 1975 Dec. 1982 1957 
Silver Jan. 1970 May 1974 1098 
Copper Apr. 1966 Dec. 1981 3970 
Lead Jan. 1970 Dec. 1981 3025 
Tin Jan. 1970 Dec. 1981 3025 
Zinc Jan. 1970 Dec. 1981 3025 


Currency 
£/$ International Nov. 1974 Sep. 1982 1997 


from January 1966 to December 1976. These returns are based on prices in 
the Daily Stock Record published by Standard and Poor's Corporation. 

The index series is the geometric average of the prices of thirty leading 
UK shares between 1975 and 1982. Dividends are ignored in theindex. Price 
series for gold, silver, copper, lead, tin, and zinc at the London bullion 
market and metal exchange are defined in various ways, using closing 
(gold), fixing (silver), and settlement prices (metals). These series range in 
length from 1099 silver prices to 3971 copper prices between 1966 and 1981. 
The gold prices are always in dollar units whilst all the other UK data are in 
sterling units. 

Currency prices are published by the International Monetary Fund in its 
monthly /nternational Financial Statistics. The dollar/sterling rate has been 
looked at from November 1974 to September 1982. 
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Futures prices 


Thirteen agricultural and four financial futures series are investigated, 
summarized by Table 2.2. Each of the goods is simultaneously traded for 
several delivery months. Thus there are several futures prices to choose 
from оп any given day. І have selected certain months of the year and then 
used the first suitable price for the selected months. 

For example, the currency prices are taken from June and December 
contracts. A June contract is used from December until the end of the 
following May, then a December contract from June until the end of 
November, as illustrated below. 


Months Contract used 


December 1975 to May 1976 June 1976 
June 1976 to November 1976 December 1976 
December 1976 to May 1977 June 1977, etc. 


Thus each of these contracts contributes six months of prices, ending with 
the final trading day of the month which precedes the delivery month. 
When the return on, say, the first day of June 1976 is calculated, the closing 
prices of December 1976 futures on that day and the last day of May 1976 
are used. This method provides one return for each day of trading. 

With the exception of the short Treasury Bond series, all the futures 
series are constructed from several blocks of prices as in the preceding 
example. Every block ends on the final day before the delivery month. The 
average amount of time covered by a particular contract is 3, 6, or 12 
months, depending on the goods traded and the dataavailable. Futures are 
generally traded very frequently during their final half-years but trading 
can be rather thin more than six months before delivery. Three months 
data per contract is not really enough for effective autocorrelation analysis. 
The ideal period for collecting prices from a contract is thus about six 
months for inclusion in a time series. It is helpful to have a further month 
or so of prices onacontract before its block begins. These additional prices 
can provide useful initial values (the firstexample appears in Section 3.10). 


Commodity futures 


Three series are studied for every one of corn traded at Chicago and cocoa, 
coffee, and sugar traded at London. The three series are given by taking 
either 3, 6, or 12 months of prices (on average) from various contracts, 
listed in Table 2.2. Each 12 month series uses a delivery month shortly after 
the main annual crop ís harvested. Comparisons between the 3, 6, and 12 
month series allow us to see how, if at all, the price generating process 
changes during the lifetime of a contract. 
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Table2.2 Description of futures series 


Dates Number 
ofdaily 
Series Market Monthsused From To returns 
Commodities 
Corn Chicago(CBOT) D Jan.1963 Dec. 1976 3195 
Chicago (CBOT) Mr,S Jan. 1963 Dec. 1976 3510 
Chicago (CBOT) Mr,My,S,D  Jan.1963  Dec.1976 3510 
Cocoa London S Jan. 1971 Aug. 1981 2684 
London Mr,S Jan.1971 Aug. 1980 2441 
London Mr,My,S,D  Jan.1971  Aug.1980 2441 
Coffee London N Jan.1971 Осі. 1981 2728 
London My, N Jan.1971 Oct. 1980 2486 
London Mr,My,S,N  Jan.1971 Oct. 1980 2486 
Sugar London D, Ja? Jan.1961 Dec. 1981 5302 
London Mr, O Jan.1961 Sep.1979 4741 
London Mr,My,A,D Jan.1961 Nov. 1979 4786 
Wool Sydney D Nov.1966 Nov. 1978 3270 
Financial 
Sterling Chicago (CME) Jn,D Jan. 1974 Nov. 1981 1990 
Deutschmark Chicago(CME) Jn,D Jan. 1974 Nov. 1981 1990 
Swiss franc Chicago (СМЕ) Jn,D Jan.1974 — Nov. 1981 1990 
T-Bond Chicago(CBOT) S^ July1981 July 1983 518 


Months: Ja = January, Mr = March, My = May, Jn = June, A = August, S = September, 
O = October, N = November, D = December. 

?December contracts until 30 November 1979, then January contracts. 

^ All prices are for the September 1983 contract. 


There are more than 3000 prices in the corn series, covering the period 
from 1963 to 1976. Previously, Guimaraes (1981) studied these prices from 
the perspective of a Portuguese corn importer. The sugar prices begin in 
1961 and the 12 month series ends in 1981 to give 5303 prices, the longest 
series investigated here. Cocoa and coffee prices cover a decade beginning 
in 1971. 

Wool futures have been traded in Sydney ror a long time. Prices from 
1966 to 1978 are studied and the first half of the series has been analysed by 
Praetz (1975). 


Financial futures 


Sterling, the deutschmark and the Swiss franc are the European currencies 
traded (at present) in large quantities by futures contracts at the Inter- 
national Monetary Market (IMM) in Chicago. These three currencies are 
studied. Daily settlement prices have been transcribed from IMM year- 
books for January 1974 to November 1981. As these prices are averages over 
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the final minute of trading, they are effectively closing prices. There are 
almost 2000 prices in each series. The IMM publishes daily trading volumes 
by contract. Volumes are relatively low when the contract has more than six 
months to run, so six months per contract are used. Itis possible to identify 
all the days when there was no trading in a contract. For sterling these 
occurred for 12 per cent of the market days before June 1976 and the 
percentages are smaller for the other series. After June 1976, the per- 
centages decreased rapidly as the trading volume accelerated upwards. 
When the volume was zero and the IMM announced a settlement price it 
has been used; otherwise the preceding day's price has been used. 

А short Treasury Bond series records the closing prices of the September 
1983 contract (at the Chicago Board of Trade) for two yearsbeginningin July 
1981. There was some business in this future on every day and frequently 
thousands of contracts were traded in a day. 


Extended series 


Articles have been published about some of the prices studied here, since 
1977. Overthe years, prices have been added to many series to make them 
more up to date. Results are presented for the complete series throughout 
this book. Only very occasionally are the conclusions different to those 
published hitherto. Such differences are caused bythe extra prices improv- 
ing the power of tests to decide between rival hypotheses. 


2.3 AVERAGE RETURNS AND RISK PREMIA 


The distribution of observed returns хі, Xz, ..., x, can be summarized by 
their average value (x), standard deviation (s), skewness (b) and kurtosis 
(К). These statistics are defined by: 


n n 
> x, 5 = > (x, — x), 
t=1 


Х- 
(2.3.1) 


У (x, — XIS, k 


The statistics estimate the respective parameters of the process generating 
returns, if the process is strictly stationary. When the process is not 
stationary the statistics can still be interesting; for example, x then esti- 
mates the average expected return. 

Tables 2.3 and 2.4 present the statistics for the time series recently 
described. We will discuss each statistic separately, commencing with the 
average return. 


Features of financial returns 33 


Table 2.3 Summary statistics for spot series 


Series Years n 10*x 1075 b k G% A% 
Stocks 
Allied 1966-76 2750 1.09 1.85 0.30 6.05 2.8 7.3 
Alcoa 1.59 1.76 —0.30 7.40 4.1 8.2 
Am. Can 1.00 1.29 0.15 5.70 2.6 4.7 
ATandT 2.24 1.01 0.29 6.44 5.8 7.2 
Am. Brands 287 1.34 034 6.17 7.5 10.0 
Anaconda 045 2.35 0.71 11.25 1.2 8.4 
Bethlehem 2.29 1.71 0.31 5.47 5.9 9.9 
Chrysler -2.09 2.44 0.40 6.96 -5.1 2.3 
Dupont —0.71 1.34 0.22 4.76 -1.8 0.5 
Kodak 203 1.57 009 6.35 5.2 8.6 
G. Electric 095 1.54 0.05 5.25 2.4 5.5 
G.Foods 150 1.52 037 5.67 1.3 4.3 
G.Motors 1.32 1.39 0.22 7.36 3.4 5.9 
G.Telephone 147 1.70 013 4.97 3.8 7.6 
Harvester 1.04 1.65 0.05 5.81 27 6.2 
Stock index 
FT30 1975-82 1794 3.26 1.29 0.01 3.51 86 10.9 
Metals 
Gold 1975-82 1957 4.63 1.85 —014 1142 124 17.3 
Silver 1970-74 1098 10.05 1.89  Á—0.16 11.50 288 347 
Соррег 1966-81 3970 098 1.85 -0.06 10.09 2.5 7.0 
Lead 1970-81 3025 3.22 2.02 -0.52 9.76 8.5 14.2 
Tin 1970-81 3025 5.47 128  -0.18 10.13 148 172 
Zinc 1970-81 3025 4.26 2.04 -0.43 15.15 113 17.3 
Currency 
£/$ 1974-82: 1997  —1.61 0.50  —0.56 8.36 -4.0 -3.5 
Notes 


n is the number of returns. 

X, s, b, and k are defined by equation (2.3.1). 

G% = 100 {exp (252Х) - 1) and A% = 100 {exp (252¥ + 12652) — 1}. 
Annual expected returns 


Averages x are very small and they can be easier to interpret if we consider 
returns over one year. Suppose the relevant market trades for N days every 
year. Then $1 invested at time 0 would have been worth 


$z,/zo = $ exp (x, +--+ x,) = $ exp (nx) 


n days or n/N years later, ignoring any transaction costs. The historic annual 
compound rate G giving the same return after n/N years is defined by 


(14 G)"N=e™ i.e. G = exp (NX) — 1. 
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Table 2.4 Summary statistics for futures series 


Series(months) Years n 10*x 102s b k G% A% 
Corn (12) 1963-76 3195 2.12 1.30 0.01 8.29 5.5 7.8 
Corn (6) 1963-76 3510 1.06 1.22 0.18 6.65 2 4.7 
Corn (3) 1963-76 3510 1.00 1.22 0.17 6.57 2.6 4.5 
Cocoa (12) 1971-81 2684 7.42 1.89 —0.15 10.09 20.6 26.1 
Cocoa (6) 1971-80 2441 8.11 1.95 —0.28 9.09 22.7 28.7 
Cocoa (3) 1971-80 2441 9.57 2.03 —0.30 9.19 27.3 34.1 
Coffee (12) 1971-81 2728 5.52 2.26 1.42 26.99 14.9 22.6 
Coffee (6) 1971-80 2486 4.83 2.13 0.62 17.90 12.9 19.6 
Coffee (3) 1971-80 2486 4.79 2.18 0.77 22.02 12.8 19.8 
Sugar (12) 1961-81 5302 2.43 2.34 0.00 8.34 6.3 13.9 
Sugar (6) 1961-79 4741 1.86 2.24 0.02 8.61 4.8 11.6 
Sugar (3) 1961-79 4786 2.10 2.29 —0.01 9.10 5.4 12.6 
Wool (12) 1966-78 3270 1.17 1.32 0.27 23.25 3.0 5.3 
Sterling (6) 1974-81 1990 0.53 0.60 —0.33 5.99 1.3 1.8 


Deutschmark 1974-81 1990 -0.21 0.60 0.01 503 -0.55 —0.1 
(6) 

Swissfranc(6) 1974-81 1990 1.16 0.73 0.44 4.38 3.0 3.7 

T-bond 1981-83 518 1.76 1.01 030 3.96 4.5 5.9 


Notes 

Corn (12) is the corn series based on 12 months of prices per contract, corn (6) uses 6 months 
data per contract, etc. 

For n returns, x, s, b, and К are defined by equation (2.3.1) and G, A are detined by the 
formulae at the end of Table 2.3. 


Values of Gare presented in Tables2.3 and 2.4, assuming there are 252 = № 
trading days per year. 

Investors are often interested in the annual expected return during a 
future year, say from times n to n + N. The simple annual return is 
uncertain and equals 


N 
R= (Zien Ж 2,//2, = exp > X] = 1. 
h=1 


Some researchers estimate E[R] using historic simple annual returns. If 
T =n/Nis an integer then the obvious estimate of E[R] is 


T-1 
r= (WT) > (nya — 24/2 = (/T) > r say. (2.3.2) 
/=0 j 
The historic compound rate G is, however, a downwards biased estimate 
of E[R]. This can be seen for integral T by noting that1 + Gis the geometric 
mean of Tterms 1 + bs hence G is less than the arithmetic mean r which is 
unbiased. 


Now, if the prices are a random walk and the returns process is station- 
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ary, with mean и and variance о”, X X,,, is approximately normal with 
mean Ny and variance No? and so 


EIR] = exp (Ми + 2Na?) — 1. 


This suggests estimating E[R] by A = exp (Nx + 3Ns°) — 1. It must be 
noted that À could be seriously biased if the returns are autocorrelated, 
since then var (E X,,4,) > Мо”. Values of А are given in Tables 2.3 and 2.4 

Cheng (1984) discusses further ways to estimate the annual expected 
return E[R]. The methods are particularly relevant when there are only a 
few years of data. 


Common stocks and ordinary shares 


Financial theory states that investors in stock markets expect a return better 
than that offered on riskless investments like government debt. This is 
certainly observed over very long periods of time and we may safely assert 
that expected daily returns, F[X,], are positive. 

Let F[R] be the annual expected simple return for a particular stock, and 
let А; be the known annual return from riskless investment. Then E[R] — Ку, 
the expected excess return, ought to depend on the risks investors accept. 
A very important model for expected returns, the capital-asset-pricing 
model, relates the risk premium for a particular stock (E[R] — Rp) to that of 
the market portfolio (say E[R,,] — К), thus: 


EIR] — R; = B(EIR,] — К,). 


The risk measure 8 equals cov (R, R,,)/var (Rm) and, over all stocks in the 
market, has weighted average equal to one. This model is described in all 
respectable finance textbooks, for example Fama (1976) and Brealey and 
Myers (1984). 

The market premium, E[R,,] — R;, is usually estimated from historic 
annual or monthly returns. Ibbotson and Sinquefield have published a 
series of books and articles about various annual returns. These contain an 
estimate of the annual market premium for the New York Stock Exchange 
equal to 9 per cent, based on returns from 1926 to 1980 (Ibbotson et al., 
1982). The compound rates G were 9.4 per cent for common stocks and 2.8 
per cent for US Treasury Bills so the premium in geometric terms was 6.6 
per cent. Merton (1980) considers the relevant possibility that the premium 
may change as the level of market risk changes. His estimates of the New 
York Stock Exchange market premium range from 8 per cent to 12 per cent. 
Dimson and Brealey (1978) estimated the UK market premium for equities 
to be 9 per cent between 1919 and 1977. Thus US and UK premia appear to 
be similar. 


Estimates of average returns are necessarily inaccurate because of the 
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volatility of prices, a conclusion emphasized by Merton (1980). The stan- 
dard errors of the annual premia for stocks are about 3 per cent despite 
having more than fifty years of data. To further illustrate the problem of 
inaccurate estimates, suppose daily returns аге uncorrelated with u = 6.17 
x 10 and ø = 0.015 giving an annual expected return E[R] = 0.2 over 250 
days. Consider a ten year sample of daily returns. The 95 per cent prob- 
ability interval for 250x is then 0.154 + 0.147, so the 95 per cent interval for 
the compound rate С = exp (250x) — 1 is from 1 to 35 per cent. 

Our US stock series have low average returns and investors would have 
obtained better results from Treasury Bills. This observation further 
emphasizes the poor accuracy of estimated average returns. 


Spot commodities 


Raw materials prices increase over long periods of time because of in- 
flation. Storing commodities is costly. Thus daily returns ought to have 
positive means and our sample means are indeed positive. 


Spot currencies 


Movements in spot exchange rates must reflect interest rates. If dollar 
interest rates stay higher than sterling interest rates for several months 
then the number of dollars required to buy one pound sterling will be 
expected to increase. Otherwise, UK investors could borrow sterling at the 
domestic rate, buy dollars and then invest them at a higher rate abroad. 
Levich (1979) has investigated similar strategies based on spot and forward 
exchange rates. He concluded that consistent profits cannot be obtained. 


Commodity futures 


The prices of futures reflect expectations about the spot price on a 
particular delivery day. This day does not change as time passes by. Thus 
futures prices are not affected systematically by inflation. Futures have no 
storage costs. It can thus be argued that the expected return from futures 
should be zero. Furthermore, brokers only require a small margin payment 
and the remaining capital invested in futures can be deposited in an 
account paying interest. And US exchanges will accept Treasury Bills as 
margin, so a speculator can sometimes keep the interest on the Bills. 

But if the average return is zero why do speculators take risks by trading 
futures? Keynes and other economists have argued that producers of 
agricultural commodities are net sellers of futures, hence speculators are 
net buyers and should be rewarded for accepting risks by a positive 
premium. Against this, it has been argued that speculators are merely 
gamblers. Dusak (1973), Black (1976), and Chang (1985) summarize the 


Features of financial returns 37 


theoretical arguments. The papers edited by Peck (1977) include contribu- 
tions to the premium debate by Cootner, Gray, and Telser. Kamara (1984) 
surveys several studies of risk premia. 

Like many authors we identify the premium with the expected return. 
We will say there is a positive risk premium if E[X,] > 0. We do not subtract 
risk-free interest rates from expected returns when defining the pre- 
mium—this is done for stocks because capital is invested whereas capital 
does not need to be invested in futures. Hedgers can be net buyers of 
futures making speculators net sellers. Thus our definition of the risk 
premium is rather casual. Chang (1985) considers returns to speculators 
allowing for both net buying and net selling. He asserts speculators have 
been rewarded for taking risks in the corn, wheat, and soybeans markets. 

A statistical test for a premium could compare the t-statistic t = үл x/s 
with the standard normal distribution and reject the hypothesis и = 0 at 
the 5 per cent significance level only if t > 1.65. To attain a 50 per cent 
chance of detecting a positive premium then requires a sample of size 
n = (1.65a/u)?, which is a function of u and о. If, for example, the 
compound rate G is as high as the US stock estimate, i.e. 9 per cent, anda 
is a relatively low 0.015, then supposing м. = log (1.09)/250 gives п = 5147. 
Even if we try to test E[R] = 0 by testing и + 307 = 0, we need n = (1.65o/ 
[u + 307])? = 2946 returns for а 50 per cent chance of rejecting the null 
hypothesis. It is quite clear that long series will not guarantee decisive 
results about the existence or otherwise of a risk premium. 

As anticipated, empirical studies do not provide a clear conclusion. 
Dusak (1973) could find no premium for US corn, soybeans, and wheat 
futures between 1952 and 1967. However, Bodie and Rosansky (1980) 
estimated the average annual return from 23 US agricultural and mineral 
futures to be 14 per cent between 1950 and 1976. The 14 per cent figure is 
an average of (simple) annual returns from ungeared long positions. It 
includes interest on Treasury Bills worth about 4 per cent per annum. The 
implied premium is therefore 10 per cent, and its standard error is appar- 
ently 4 per cent. Thus the premium appears to be statistically significant. 
Nevertheless, the choice of years (1950 to 1976) may be fortuitous. There 
were high returns at both ends of the period. If 1950-1 and 1969—76 are 
discarded, the 14 per cent average falls to less than 5 per cent for 1952 to 
1968 inclusive. 

Table 2.4 gives the values of the annual compound returns G for my 
futures prices. The cocoa and coffee figures are high, 21 per cent and 15 per 
cent respectively fora year's prices per contract. Averages of simple annual 
returns are higher: 33 per cent for cocoa, 24 per cent for coífee, and 34 per 
cent for sugar, calculated using equation (2.3.2). The test for a premium 
using t = V/nx/s only gives significant values for cocoa, at the 5 per cent 
significance level. There is no evidence fora premium on currency futures. 
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2.4 STANDARD DEVIATIONS 


Returns often appear to have fluctuating standard deviations. Many refer- 
ences could be given, for example Merton (1980, p. 353) and Ali and 
Giaccotto (1982) for US stocks, and Labys and Thomas (1975) and Praetz 
(1975) for futures. Table 2.5 shows the considerable variation from year-to- 
year in the standard deviations of futures. Figure 1.4 (p. 7) emphasizes this 
variation by comparing the price histories of two sugar contracts. 


Table 2.5 Contract standard deviations 


12 months used per contract, standard deviation x 100 tabulated 


Year Corn Cocoa Coffee Sugar Wool 
1961 — — — 0.91 -- 
1962 — — — 1.51 — 
1963 0.82 — — 3.43 — 
1964 0.52 — — 2.39 — 
1965 0.42 — — 1.56 — 
1966 0.97 — — 1.33 0.54 
1967 0.82 — — 2.75 0.56 
1968 0.69 — — 1.80 0.64 
1969 0.77 — — 1.66 0.41 
1970 1.18 — -- 1.04 1.25 
1971 1.42 1.17 0.49 1.02 1.00 
1972 0.85 1.21 0.60 2.52 2.33 
1973 2.44 2.01 0.88 1.49 3.08 
1974 2.05 2.41 1.77 3.89 1.65 
1975 1.72 2.11 2.37 4.35 0.77 
1976 1.22 1.76 2.25 2.09 0.66 
1977 — 3.05 3.91 1.47 0.54 
1978 — 1.71 2.77 1.44 0.27 
1979 — 1.38 1.98 1.56 — 
1980 — 1.30 1.90 3.61 — 
1981 = 1.32 3.07 2.48 = 
6 months used per contract, s.d. x 100 tabulated 
Sterling Deutschmark Swiss Franc 
Year June Dec. June Dec. June Dec. 
1974 0.76 0.38 0.77 0.62 0.87 0.68 
1975 0.41 0.40 0.62 0.50 0.76 0.53 
1976 0.51 0.71 0.36 0.29 0.40 0.39 
1977 0.39 0.42 0.29 0.38 0.30 0.42 
1978 0.57 0.64 0.62 0.77 0.80 1.04 
1979 0.48 0.71 0.50 0.49 0.65 0.78 
1980 0.67 0.52 0.68 0.51 0.92 0.65 
1981 0.66 0.99 0.85 0.91 0.97 0.96 


Reproduced from Taylor (1985) by permission of Chapman and Hall Ltd. 
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Tests for changes in standard deviation (or variance) usually specify 
normal distributions or strict white noise in the null hypothesis. Neither 
assumption is valid, as we shall soon see, hence tests are not straight- 
forward. Returns could come from a stationary process yet appear to have 
fluctuating standard deviations. We explore the reality or illusion of the 
fluctuations in Chapters 3 and 4. Until then, we concentrate on elementary 
comparisons. 


Risks compared 


Standard deviations measure the speed at which prices are changing. Alow 
standard deviation will mean that the chance of a large price fall is relatively 
low. Thus the risks from two alternative investments can be contrasted by 
comparing their standard deviations. Investors ought to seek an expected 
return proportional to the risk accepted. When many investments can be 
combined into a portfolio then covariances with a market portfolio are a 
more important risk measure than individual variances. Calculating 
covariances is very difficult when stocks are not the only possible type of 
investment. Consequently, we will use the standard deviation (or variance) 
as a simple risk measure. 

French (1980, p. 58) gave standard deviations for Standard and Poor's 
500-share index from 1953 to 1977. Perry (1982, p. 863) and Kon (1984, p. 151) 
provide figures for the stocks of several large US companies whilst Brown 
and Warner (1985, p. 9) give an average standard deviation for a very large 
sample of randomly selected securities. Perry, Kon, Brown, and Walker all 
consider returns from 1962 until some date between 1977 and 1980. Using 
the US estimates and Tables 2.3 and 2.4 we can try to rank the riskiness of 
various investments, as follows. Obviously all figures are approximate. 


Investment Standard deviation of returns 
Currencies 0.006 
Standard and Poor's 500 0.007 
Financial Times 30 0.013 
US stocks (large) 0.015 
Gold and metals 0.018 
London agricultural futures 0.022 
US stocks (random) 0.027 


To see how these figures tell us something about risk, consider $1 
invested for one year in a futures contract. Suppose the daily returns have 
zero mean, standard deviations as above and are uncorrelated. Also 
suppose there are 250 of them in a year and their total has a normal 
distribution. Then it can easily be shown that 95 per cent of the time a 
currency investment will be worth between $0.83 and $1.20 at the end of 
the year, compared with a range from $0.51 to $1.98 for a commodity like 
sugar. 
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Futures and contract age 


It has been said that the standard deviation of futures returns depends оп 
the age of the contract. As the delivery date comes closer, the standard 
deviation is believed to increase. Rutledge (1976) provided evidence in 
favour of this hypothesis but only studied one contract for each of four 
commodities. 

The hypothesis can be assessed by comparing the standard deviations 
for subsets of a time series. Each contract is split in half and standard 
deviations calculated, say s,, and s,; for contract í. Thus if a series takes 12 
months of prices from contract i, s; is the standard deviation over the 
period starting 13 months and finishing 7 months before the delivery date, 
with sp the standard deviation over times between 7 and 1 months before 
the contract terminates. Some contracts contribute less prices to a series 
than the standard number, usually because of the dates on which the series 
begins and ends. Such contracts have been excluded from the following 
calculations. The geometric mean g of the ratios f, = 5,2/5;1 estimates the 
increase, if any, in the standard deviation in the second period relative to 
the first. Now, for k contracts, 


k 
g-fh...f, and log (g) = У log (£)/k. 
1-1 


When s, апа s; һауе identical distributions, random variables log (f) have 
zero expectations. A value of g is therefore significantly different from 1 if 
the k numbers log (4) have a sample mean significantly different from zero. 

A standard, one-tailed Student's t test has been used with a 5 per cent 
significance level. The following table presents the geometric means for 
the 3, 6, and 12 months per contract series. 


Months used per contract 


Futures 3 6 12 
Corn 0.84 1.04 1.43 
Cocoa 0.98 1.01 1.46 
Coffee 0.93 0.94 1.77 
Sugar 1.02 1.09 1.02 
Wool — — 0.98 
Sterling — 1.04 — 
Deutschmark == 1.03 - 
Swiss franc — 0.99 — 


(Reproduced from Taylor (1985) by permission of Chapman and Hall Ltd.) 


The values of g are only significantfor the 12 month corn, cocoa, and coffee 
series. Indeed they would be significant at the 1 per cent level and each 
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series has at most two years in which s;, exceeded s;. All the other series 
have geometric means very close to 1. Therefore the standard deviation 
does not increase systematically during the final six trading months. 


2.5 CALENDAR EFFECTS 


Day-of-the-week 


Most daily returns measure the return during 24 hours. Returns from 
Friday's close to Monday's close, however, represent the result of an 
investment for 72 hours. We might therefore expect returns calculated on 
Mondays to have a distribution differing from that of 24 hour returns. 
Stocks could be expected to have higher average returns on Mondays than 
on other days. Variances could be higher for Monday returns as more new 
information should appear in 72 hours than in 24. The empirical evidence 
is rather surprising. 


Stocks 


French (1980) calculated daily returns on the Standard and Poor's stock 
index between 1953 and 1977. The daily means and standard deviations are 
instructive. 


Monday Tuesday Wednesday Thursday Friday 


Mean (x 100) —0.168 0.016 0.097 0.045 0.087 
Standard deviation (x 100) 0.843 0.727 0.748 0.686 0.660 
Observations 1170 1193 1231 1221 1209 


Monday's negative mean is very highly significant when a t-test for a zero 
population mean is performed. However, the assumptions for such a test 
(which include independent and identical distributions on all Mondays) 
will not be valid. French also shows that Monday's mean is negative for 
each of five sub-periods of five years and is negative for 20 of the 25 years. 
Gibbons and Hess (1981) looked at the 30 stocks in the Dow Jones index and 
discovered that every stock had a negative mean return on Mondays 
between 1962 and 1978. Articles by Keim and Stambaugh (1984) and 
Rogalski (1984) are also relevant. Beyond reasonable doubt, Monday prices 
have been determined in a strange way. No satisfactory explanation has yet 
been given for Monday's negative mean. 


Currencies 


MacFarland et a/. (1982) investigated various spot and forward currency 
prices recorded between 1975 and 1979. Contracts denominated in dollars 
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rose on Mondays and Wednesdays and fell on Thursdays and Fridays 
during the four-year period. It is not easy to determine the significance of 
these results because again the assumption of independent and identical 
distributions will not be valid. MacFarland et al. attribute the Wednesday 
rises and Thursday falls to the clearing system for these currency contracts 
(p. 695). 

My currency futures give the following mean returns for the eight-year 
period from 1974 to 1981. 


Mean x 100 Monday Tuesday Wednesday Thursday Friday 


£/$ 0.026 —0.012 0.115 —0.024 —0.058 
DM/$ 0.014 —0.021 0.105 —0.104 -0.009 
SF/$ 0.036 — 0.035 0.155 —0.118 -0.000 


A long position in all three currencies would have averaged a 0.12 per cent 
gain from the Tuesday close to the Wednesday close followed by an 
average loss of 0.08 per cent over the next 24 hours. Student t-tests reject 
the hypothesis of zero average returns on Wednesdays and Thursdays for 
five of the six possible tests, at the 1 per cent significance level; but again 
the test assumptions are dubious. 


Agricultural futures 


There is not strong evidence for daily effects in the mean returns from 
corn, cocoa, coffee, and sugar futures. The Friday averages for cocoa, 
coffee, and sugar are, however, all about 0.20 per cent. Roll (1984) reports 
daily averages for orange juice, only Monday's is negative. 


Standard deviations 


French's figures show clearly that the standard deviation for Monday's 72 
hour return is more than that for24 hourreturns, at least for US stocks. The 
Monday standard deviation is about 1.2 times the standard deviation on 
other days. A far higher ratio of 1.73 ... = V3 is expected if prices follow а 
standard diffusion process in calendar time. Clearly they do not. 

To compare standard deviations for specific days of the week, without 
having to model longer term changes in the standard deviation, | use the 


following method. Suppose Z, 2,1, ..., Z+5 are prices from a Friday 
close to the next Friday close, let x4; = log (Z,4;/Z;4;-1) as usual, and let 
Wi; = X? (а + Хх?) + +++ + х2, 5). Then, for each day of the week, find 


the average value of wover all occasions when the market was open for six 
consecutive trading days commencing with a Friday. The five averages 
obtained, one for each day of the week, estimate the proportion of the total 
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variance in a week due to each day. These estimated proportions are as 
follows for various futures series. 


Percentage proportions of a week's variance 


Monday Tuesday Wednesday Thursday Friday 


Corn 28.2 18.5 18.9 17.7 16.7 
Cocoa 21.4 21.7 19.8 18.5 18.6 
Coffee 22.8 18.7 19.9 18.7 19.9 
Sugar 22.2 20.5 20.2 19.5 17.5 
£/$ 23.1 20.6 20.1 17.8 18.4 
DM/$ 26.0 19.9 19.6 18.6 16.0 
SF/$ 24.0 18.6 19.4 20.8 17.3 
Average 24.0 19.7 19.7 18.8 17.8 


Obviously Monday returns have higher standard deviations than other 
returns. The average variance proportion is 24 per cent for Mondays and 
about 19 per cent on the other four days. Thus the ratio of Monday's 
standard deviation to other days' standard deviation is about \/(0.24/0.19) 
or 1.12. 


Month-of-the-year effects for stocks 


Average returns from stocks are significantly higher in some months than 
others. Rozeff and Kinney (1976), Keim (1983), and Tinic and West (1984) 
have shown US returns are particularly high at the start of the new tax year 
in January. Praetz (1973) and Officer (1975) had previously shown Australian 
return distributions depend upon the month, whilst Gultekin and Gultekin 
(1983) have documented monthly means for 17 countries. Small firms earn 
higher returns on average than predicted by a market-factor model (Banz, 
1981) and these anomalous excess returns tend to occur early in January 
(Keim, 1983). Schwert (1983) summarizes several papers about the firm size 
and January effects. Schultz (1985) provides empirical evidence that per- 
sonal taxes are related to the January effect. 

The magnitude of the January effect varies across companies. An 
average-risk US company appears to have returned 4 per cent per month 
on average in January between 1935 and 1982 butonly 1 per centon average 
in the other months (Tinic and West, 1984, p. 572). In daily terms the 
average difference between January and other returns is then 0.15 percent, 
slightly less in magnitude than the estimated average difference between 
Monday and other returns. 

Although calendar effects are puzzling from a theoretical viewpoint, 
fortunately they have negligible consequences for tests based upon 
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autocorrelation coefficients calculated from daily returns. This is due to the 
high variability of returns. Theoretical arguments are summarized in an 
appendix to this chapter (Section 2(A)) and supported by simulations in 
Section 6.11. 


2.6 SKEWNESS 


Skewness statistics are used to assess the symmetry of distributions. Nearly 
all the estimates of b, defined by equation (2.3.1) and presented in Tables 
2.3 and 2.4, are close to zero. This shows that the sample distributions are 
approximately symmetric. The standard error of an estimate b calculated 
from n returns depends on n and the population distribution. It is V(6/n) 
for a random sample from a normal distribution. This formula is, however, 
of little value for returns, because their distributions have high kurtosis. 
One or two large observations, generally called outliers, can be respon- 
sible for an apparently large skewness estimate. This occurs for my coffee 
returns based on 12 months' data per contract. An outlier 13.8 standard 
deviations from the mean is mainly responsible for the estimate b — 1.42. 

There is some evidence that US stock returns are positively skewed and 
that metals returns are negatively skewed. Considering US stocks, 14 of my 
15 skewness estimates are positive. Perry (1982) has 33 out of 37 positive for 
longer series, all but one having about 3900 returns. The average value of 
Perry's skewness estimates is 0.13. 


2.7 KURTOSIS 


Normal distributions have kurtosis equal to 3. Sample estimates of the 
kurtosis (denoted by k) are nearly always far greater than 3; all my estimates 
exceed 3.5 and a majority of them have K > 6. The standard error of an 
estimate is V(24/n) for Gaussian white noise. Every estimate given here 
exceeds 3 by more than four of these standard errors. It is very clear that the 
returns generating process is not even approximately Gaussian. 

High values of K are caused by more observations several standard 
deviations from the mean than predicted by normal distributions. Only 
one observation in 15800 is more than four standard deviations away from 
the mean of a normal distribution. The frequency of this event is 1 in 293 for 
my US stock returns, 1 in 138 for the metals, and 1 in 156 for the agricultural 
futures (12 months’ data used per contract). It appears that metals and 
agricultural goods have more kurtosis and more extreme outliers than 
currencies and US stocks. Tables 2.6 and 2.7 describe the numbers and 
magnitudes of the outliers. 
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Table 2.6 Number of outliers, spot series 


Frequency of event: |x, — Х| > ks? 


k= 

Series Returns 2 3 4 5 6 7 8 9 
Stocks 
Allied 2750 145 45 10 2 1 
Alcoa 2750 144 34 10 5 3 2 
Am. Can 2750 151 35 4 3 1 
ATandT 2750 147 29 9 4 3 
Am.Brands 2750 146 35 14 2 1 
Anaconda 2750 137 42 17 6 5 4 1 
Bethlehem 2750 139 31 12 4 
Chrysler 2750 128 40 14 6 2 
Dupont 2750 158 29 7 1 
Kodak 2750 133 32 5 2 1 1 1 
G. Electric 2750 154 34 7 1 
G. Food 2750 140 30 7 3 1 
C. Motors 2750 150 33 9 4 3 2 
G.Telephone 2750 141 37 7 1 
Harvester 2750 147 33 9 3 1 
Stock index 
FT30 1794 79 8 1 
Metals 
Gold 1957 110 27 9 7 2 1 1 1 
Silver 1098 51 18 11 4 3 1 
Copper 3970 204 75 29 15 5 3 1 
Lead 3025 155 54 16 7 2 1 1 1 
Tin 3025 151 61 27 8 4 1 
Zinc 3025 167 56 25 16 10 4 1 1 
Currency 
£/$ 1997 109 29 12 6 2 1 
ах, = return оп day f, X = average, s = standard deviation. 


Only the non-zero frequencies are shown in the table. 


2.8 PLAUSIBLE DISTRIBUTIONS 


A satisfactory model for daily returns must have a probability distribution 
similar to the observed distribution. Symmetric distributions have usually 
been considered adequate and we will follow this convention. Thus we are 
interested in plausible, symmetric distributions having high or possibly 
infinite kurtosis. A literature review now follows; distributions are fitted to 
my data in the next chapter. 

Normal distributions would be expected if the (log) price changes within 
a trading day are characterized by: (1) finite variance, (2) identical distribu- 


46 Modelling financial time series 


Table 2.7 Number of outliers, futures series 


Frequency of event: |x, — x| > ks? 


Series k= 
(months) Returns 2 3 4 5 6 7 8 9 10 11 


Commodities 


Corn (12) 3195 238 55 6 2 2 1 1 1 

Cocoa (12) 2684 151 37 7 2 2 1 1 1 1 
Coffee (12) 2728 127 50 23 7 4 3 2 2 1* 
Sugar (12) 5302 296 106 35 12 2 2 

Wool (12) 3270 162 74 41 20 10 5 4 2 2 
Financial 

£/$ (6) 1990 134 30 11 1 

DM’/$ (6) 1990 128 24 2 1 1 

SF/$ (6) 1990 109 21 2 1 

T-bond (—) 518 20 4 1 


?x, = return on day t, x = average, s = standard deviation. 
° Months of returns used per futures contract. 

“Represents return equal to x + 13.85. 

Only the non-zero frequencies are shown in the table. 


tions, (3) the same number of them each day, and (4) this number is large. 
These conditions ensure an asymptotic normal distribution for an infinite 
number of within-day changes. Normal distributions are not observed 
even when characteristic (4) occurs. Most derivations of plausible distribu- 
tions either suppose (1) is false or at least one of (2) and (3) is false. 

Mandelbrot (1963) and Fama (1965) supposed that any sum of (log) price 
changes belongs to the same parametric family of probability distributions. 
They do not assume finite variance. This leads to the stable or Pareto-Levy 
distribution, defined for a symmetric variable X, by its characteristic 
function, 


Elexp GkX)] = idk — (y|k|) “ (2.8.1) 


with 2 = —1 and К any real number. The parameters of this symmetric 
distribution are the characteristic exponent а, a dispersion parameter y, 
and a location parameter ё. If a process {X,} is strict white noise then any 
total X, + X,,, + +++ + X will also have characteristic exponent а. 

Fama and Roll (1968) describe a practical method for estimating a. 
Estimates are always between the special cases « — 1 for Cauchy distribu- 
попѕапа с = 2 for normal distributions. Many researchers find the conclu- 
sion of infinite variance, when « « 2, unacceptable. Detailed studies of 
stock returns have conclusively rejected the stable distributions (Blattberg 
and Gonedes, 1974; Hagerman, 1978; Perry, 1983). Hagerman, forexample, 
shows that estimates of a steadily increase from about 1.5 for daily returns 
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to about 1.9 for returns measured over 35 days. Returns over a month or 
more have distributions much closer to the normal shape than daily 
returns. A decade after his 1965 paper, Fama prefers to use normal 
distributions for monthly returns and so to discard stable distributions for 
daily returns (Fama, 1976, Ch. 1). 

There is no reason to suppose that the number of price changes within 
each day is identical. Neither is it particularly reasonable to assume that 
every change in price has the same variance. Consequently, Praetz (1972), 
Clark (1973), and many others have argued that observed returns come 
from a mixture of normal distributions. The random variable X, then has 
conditional distributions, 


X [ex ~ Niu, Қа) (2.8.2) 


for various possibilities аж; is supposed to be constant and Қа») is the 
conditional variance. The quantity о, could be trading volume with 
Қо) = Aw?, A, B > 0 as in Clark (1973). Granger and Morgenstern (1970), 
Rogalski (1978), and Rutledge (1979) give further examples of the depen- 
dence of returns distributions upon trading volume. Or e, could measure 
the number of new pieces of relevant information arriving at the market on 
day t, with f(a) = A + Во, A > 0, B > 0 (e.g. Beckers, 1981a; Tauchen 
and Pitts, 1983). Tauchen and Pitts' paper is particularly novel and offers 
joint distributions for returns and volume conditional upon the amount of 
new information and the number of traders. 

However we choose to interpret а), the conditional variance f(«) will 
have some distribution and hence X, will have a particular unconditional 
distribution. Let V? denote the conditional variance on day t. Two distribu- 
tions for V? can justify serious consideration. 

First, Praetz (1972) suggested the inverted gamma, for which the density 
function of V? is a constant times 


—(k+2) а= (12 k-1) о/м? 
v , 


e 


o? and k > 2 being the parameters of the distribution. The great advantage 
of the inverted gamma is a mathematically convenient density for returns. 
This is a Student's t density, with k degrees of freedom, defined by 


hix) = КК — 277g BG, ЮГІП + (x — &Y/((k — 2)9?)] Cn, 
(2.8.3) 


The term BG, šk) is a Beta coefficient. The density h(x) has mean д and 
variance o2 with a shape determined by К. Values of k x 4 give infinite 
kurtosis, whilst for k > 4 the kurtosis is 3 + 6/(k — 4). As k — о, h(x) 
converges to a normal density. Blattberg and Gonedes (1974) use likeli- 
hood-ratio and other methods to show that Student distributions fit US 
stock returns far better than stable distributions. For 30 stocks, 8 maximum- 
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likelihood estimates K were less than 4, and 16 estimates were between 4 
and 6. Kon (1984) has all 33 estimates K between 3 and 6. 

Second, Clark (1973) supposed that the conditional standard deviations 
and variances have lognormal distributions. If log (М) ~ N(a, B?) then V, 
has density 


[27)78v]^! exp {—3(log v — a)?/g?). (2.8.4) 


All the moments of the returns distribution are then finite, however the 
density function must be expressed as an integral. Despite this difficulty, 
the lognormal permits the calculation of various relevant statistics. As 
log (V) = r log (М) ~ Ntra, r282) and ҒІУД = exp (ra + 37282), itcan be 
shown that the returns have variance E[V?] = exp (2a + 28?) and kurtosis 
3E[VtO/ELV2]? = 3 exp (48?). Further results for these distributions appear 
in the next chapter. 


2.9 AUTOCORRELATION 


The correlation between returns separated by a time-lag of 7 days can be 
estimated from n observations by the sample autocorrelation coefficient: 


Gy = Y (x, — X)60q.. — 9/5 (х, — х)?, т> 0. (2.9.1) 
t=1 t=1 


Alternative formulae have sometimes been used by other writers but the 
coefficients obtained are virtually identical for long series. There are a few 
more terms in the denominator of r, , than in the numerator. This causes a 
slight bias. When we want to avoid it we use the adjusted coefficient 
rey = пг, Кп — т). The unadjusted coefficient always has variance less 
than 1/n fora strict white noise process but this is not true for the adjusted 
coefficient. Finite sample properties of the coefficients are discussed in 
Section 6.3. 

Equation (2.9.1) should be altered for futures series. Suppose k contracts 
contribute returns to the series. Then (k — 1)7 of the products (x, — x) 
(ха, — X) contain two returns calculated from different contracts, for 
sufficiently low т. These (К — 1)7 mixed products cannot be expected to 
have the same distribution as the remaining n — Кт products. Con- 
sequently, for futures series, | omit the (k — 1)т mixed products from the 
numerator of (2.9.1). Every remaining product is then calculated from two 
returns upon the same contract. The adjusted coefficient now becomes 
гуу = nr,,/(n — Кт). 

Autocorrelation formulae assume constant expected returns, E(X,), esti- 
mated by x. Changes, if any, in expected returns are not large enough to 
make r,, a seriously biased estimate of the population correlation, 
cor (X,, Xa+). This is shown in Appendix 2(A) for day-of-the-week effects, 
whilst inflation and other determinants of expected returns are covered in 
Section 6.11. 
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Autocorrelation coefficients r,, have been calculated for all lags т 
between 1 and 30 trading days inclusive. These coefficients are all close to 
zero. To summarize the signs and magnitudes of the coefficients each is 
assigned to one of six classes. These are: 


(1) r < —0.1 (2) —0.1 < r < —0.05 (3) —0.05 < r < 0 
(4) 0 < r < 0.05 (5) 0.05 < rs 0.1 (6) 0.1 < r 


Table 2.10 gives, by series, the coefficient at a lag of one day and the 
number of the 30 coefficients in each class. The information is here 
summarized for various sets of series using the six classes, firstly for 7 — 1 
and secondly for1 < т < 30. 


First-lag 


All the first-lag coefficients are positive for the US stock returns between 
1966 and 1976. The coefficients range from a minimum of 0.037 for Kodak to 
a maximum of 0.150 for Alcoa. Fama (1965) obtained 22 positive and 8 
negative first-lag coefficients for US stocks between 1957 and 1962; Greene 
and Fielitz (1977) found 137 positive and 63 negative between 1963 and 
1968; and Perry (1982) has all 37 first-lag coefficients positive for longer 
series, from 1962 to 1977. It therefore appears that US stock returns on 
consecutive days are positively correlated. Care is required, however, asall 
the returns depend on a common market factor and so the coefficients of 
different stocks are not independent of each other. More about this in the 
appendix to Chapter 6. 

Stock indices can display more first-lag correlation than the individual 
stocks, because ofthe market factor and thin trading of smaller companies. 
Officer (1975), Scholes and Williams (1977), and Gibbons and Hess (1981) 
provide numerical evidence. 

Seven of the eight long futures series used for Table 2.8 have a positive 
first-lag coefficient but the pattern is reversed for the metals; they only 
have one out of six positive. 


Table2.8 Frequencies of first-lag autocorrelations 


Class 
Setofseries 1 2 3 4 5 6 
15US stocks 0 0 0 3 8 4 
6 metals 0 4 1 1 0 0 
8futures? 0 0 1 6 1 0 
31 various? 0 4 2 12 9 4 


?Agriculturals, 12 months per contract, and currencies. 
515 stocks, 6 metals, 8 futures, FT30, and £/$ spot. 
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Lags 1 to 30 


Less than 10 per cent of the coefficients exceed 0.05 in absolute value and 
only about 1 per cent of the absolute values exceed 0.1. These low 
frequencies emphasize how small the sample autocorrelations are. Any 
dependence in the stochastic process generating returns must be small. 

There is a notable preponderance of negative coefficients for the US 
stocks for lags of two days or more. Fifty-six per cent of the coefficients are 
negative for lags 2 to 30 inclusive. Fama (1965) notes the same negative 
tendency, for lags 2 to 10, using returns recorded for years preceding those 
studied here. The day-of-the-week effects, particularly negative Monday 
returns, might help to explain these results. Some calculations are given in 
Appendix 2(A). 

A sharp contrast is provided by the futures coefficients. Consider the 240 
coefficients for the eight series used in Table 2.9. For these coefficients, 57 
per cent are positive at lags 2 to 30, and recall that 7 out of 8 are positive at 
lag 1. Of the 20 coefficients either above 0.05 or below —0.05, 17 are 
positive and only 3 are negative. Just 1 of the 17 largest positive coefficients 
is at a lag of one day. The reality and importance of small, positive 
autocorrelation at several lags will be looked at in depth throughout 
Chapters 6, 7, and 8. 


Tests 


The autocorrelation coefficients r,, could be used to test whether the 
generating process {X,} is strict white noise. This will not be done as far 
more conclusive results can be obtained from transformed returns, as we 
will see in the next section. 

Of course the really interesting hypothesis is not strict white noise but 
uncorrelated X,. Tests for zero correlation cannot use the r,, and а 


Table 2.9 Frequencies of autocorrelations at lags 1 to 30 


Class 
Set of series 1 2 3 4 5 6 
15 US stocks 0 9 234 191 12 4 
(0%) (2%) (52%) (42%) (3%) (1%) 
6 metals 1 13 63 86 14 3 
(396) (796) (3596) (4896) (896) (12%) 
8futures 1 2 97 123 17 0 
(396) (1%) (40%) (51%) (7%) 
31 various 2 25 414 439 43 7 


(0.296) (2.7%) (44.5%) (47.2%) (4.6%) (0.8%) 
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Table 2.10 Autocorrelations r, , for returns 


Lags 1-30, frequency by class? 


Lag 1 

Series ix 1 2 3 4 5 6 
Allied 0.086 0 1 14 13 2 0 
Alcoa 0.150 0 1 16 12 0 1 
Am. Сап 0.144 0 0 18 11 0 1 
ATandT 0.088 0 0 19 10 1 0 
Am. Brands 0.093 0 2 18 9 1 0 
Anaconda 0.057 0 0 18 11 1 0 
Bethlehem 0.105 0 1 18 10 0 1 
Chrysler 0.045 0 1 11 18 0 0 
Dupont 0.114 0 0 11 18 0 1 
Kodak 0.037 0 0 15 14 1 0 
G. Electric 0.099 0 1 16 11 2 0 
G. Food 0.089 0 0 15 14 1 0 
G. Motors 0.050 0 1 13 15 1 0 
G. Telephone 0.052 0 0 19 10 1 0 
Harvester 0.086 0 1 13 15 1 0 
FT30 0.033 0 0 11 19 0 0 
Gold —0.052 0 2 10 14 4 0 
Silver —0.080 1 7 3 12 4 3 
Copper —0.052 0 1 10 19 0 0 
Lead —0.020 0 0 12 17 1 0 
Tin 0.020 0 1 14 14 1 0 
Zinc —0.073 0 2 14 10 4 0 
£/$, spot 0.038 0 1 9 20 0 0 
Corn (12) —0.026 0 2 14 10 4 0 
Corn (6) 0.005 0 0 12 15 3 0 
Corn (3) 0.013 0 0 13 15 2 0 
Cocoa (12) 0.016 0 0 11 16 3 0 
Cocoa (6) 0.035 0 0 11 15 4 0 
Cocoa (3) 0.071 0 0 11 16 3 0 
Coffee (12) 0.026 0 0 15 12 3 0 
Coffee (6) 0.062 0 0 16 11 3 0 
Coffee (3) 0.062 0 1 15 11 2 1 
Sugar (12) 0.044 0 0 9 21 0 0 
Sugar (6) 0.064 0 0 6 23 1 0 
Sugar (3) 0.064 0 0 7 22 1 0 
Wool (12) 0.075 1 0 15 11 3 0 
£/$ (6) 0.034 0 0 8 22 0 0 
DM/$ (6) 0.011 0 0 13 15 2 0 
SF/$ (6) 0.049 0 0 11 17 2 0 
T-bond —0.093 0 4 15 7 4 0 


a The six classes аге (1) r < —0.1, (2) —0.1 < r < —0.05, (3) —0.05 < r < 0, (4) 0 < r = 0.05, 
(5)0.05<г<0.1,(6):>0.1. 
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conventional standard error of 1/\/п without risking dubious conclusions. 
This assertion will be justified in Chapter 5 and be strongly implied by the 
following results. 


2.10 NON-LINEAR STRUCTURE 


Not strict white noise 


Recall that a process {X,} is strict white noise (SWN) if the X, are indepen- 
dently and identically distributed. Also remember that the distribution of 


= S (X-X X» ~ 0 / 5 o = Xy 
t=1 t=1 

is approximately N(0, 1/n) for finite-variance SWN (Section 1.10). Most of 
the first-lag coefficients listed in Table 2.10 have Vn|r,,| > 2, suggesting 
that the returns series are not realizations of SWN. 

To establish this conclusion more decisively we introduce the idea of 
transforming returns and then finding autocorrelations, first suggested by 
Granger and Andersen (1978). For if (X,) is a SWN process so too are the 
processes {|X,|} and (X7). The first lag coefficients of observed absolute 
returns |х, and squared returns x? are denoted by 


ШЕН апа LENT 


They are listed in Tables 2.11 and 2.12. Ignoring the short T-bond series, we 
can deduce that every other series has г > 4.6/V/n and rx  3.5/V/n. 
The standard error of a coefficient calculated from absolute returns will be 
туп if {X,} is finite-variance SWN; the same standard error is applicable 
for the coefficients of squared returns providing the X, also have finite 
kurtosis. Finite variance is certainly a very reasonable conclusion. Thus the 
high correlation observed between |x| and |х,;|| proves conclusively that 
the returns process is not SWN. Note that this tells us nothing about the 
correlation between returns. 


A characteristic of returns 


Coefficients r, „| and г, > for lags тир to 50 days are nearly always positive 
and larger than г, ,, the autocorrelations of the returns. This can be seen by 
comparing Table 2.10 with Tables 2.11 and 2.12. Table 2.10 shows that less 
than 10 per cent of the returns coefficients are outside the range —0.05 to 
0.05, for lags 7 from 1 to 30. By contrast, most of the coefficients for data |x| 
and x; are greater than 0.05; the proportions for squared returns are 58 per 
cent for US stocks, 85 per cent for the metals and 84 per cent for the futures 
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Table 2.11 Autocorrelations r, ,. for absolute returns 


Lags 1—30, frequency by class 


Lag 1 

Series Fi ix] 1 2 3 4 5 6 
Allied 0.204 0 0 0 1 25 4 
Alcoa 0.194 0 0 0 3 18 9 
Am. Can 0.107 0 0 0 14 15 1 
AT and T 0.168 0 0 5 17 6 2 
Am. Brands 0.165 0 0 0 20 7 3 
Anaconda 0.144 0 0 0 17 12 1 
Bethlehem 0.130 0 0 3 19 7 1 
Chrysler 0.127 0 0 0 9 18 3 
Dupont 0.202 0 0 0 0 7 23 
Kodak 0.146 0 0 0 0 16 14 
G. Electric 0.195 0 0 0 0 0 30 
G. Food 0.196 0 0 0 2 20 8 
G. Motors 0.174 '0 0 0 0 20 10 
G. Telephone 0.154 0 0 0 1 24 5 
Harvester 0.175 0 0 0 3 24 3 
FT30 0.109 0 0 0 3 19 8 
Gold 0.294 0 0 0 0 0 30 
Silver 0.284 0 0 0 0 0 30 
Copper 0.280 0 0 0 0 0 30 
Lea 0.336 0 0 0 0 0 30 
Tin 0.321 0 0 0 0 0 30 
Zinc 0.398 0 0 0 0 0 30 
£/$, spot 0.271 0 0 0 0 1 29 
Corn (12) 0.385 0 0 0 0 0 30 
Corn (6) 0.407 0 0 0 0 0 30 
Corn (3) 0.415 0 0 0 0 0 30 
Cocoa (12) 0.281 0 0 0 0 1 29 
Cocoa (6) 0.262 0 0 0 0 7 23 
Cocoa (3) 0.247 0 0 0 7 7 16 
Coffee (12) 0.374 0 0 0 0 0 30 
Coffee (6) 0.394 0 0 0 0 0 30 
Coffee (3) 0.449 0 0 0 0 0 30 
Sugar (12) 0.311 0 0 0 0 0 30 
Sugar (6) 0.298 0 0 0 0 0 30 
Sugar (3) 0.273 0 0 0 0 8 22 
Wool (12) 0.461 0 0 0 0 0 30 
£/$ (6) 0.234 0 0 0 1 9 20 
DM/$ (6) 0.241 0 0 0 0 7 23 
SF/$ (6) 0.242 0 0 0 0 1 29 
T-bond 0.050 0 0 2 10 12 6 
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Table 2.12  Autocorrelations r, ,; for squared returns 


Lags 1-30, frequency by class 


Lag 1 

Series LE 1 2 3 4 5 6 
Allied 0.229 0 0 0 7 18 5 
Alcoa 0.144 0 0 0 11 14 5 
Am. Сап 0.075 0 0 0 21 9 0 
ATandT 0.131 0 0 11 16 2 1 
Am. Brands 0.175 0 0 3 21 4 2 
Anaconda 0.068 0 0 6 22 2 0 
Bethlehem 0.107 0 0 3 21 4 2 
Chrysler 0.113 0 0 1 15 13 1 
Dupont 0.194 0 0 0 0 12 18 
Kodak 0.178 0 0 0 3 16 11 
C. Electric 0.191 0 0 0 0 1 29 
G. Foods 0.189 0 0 0 9 15 6 
G. Motors 0.168 0 0 0 7 19 4 
G. Telephone 0.154 0 0 0 3 21 6 
Harvester 0.170 0 0 1 10 17 2 
FT 30 0.149 0 0 0 2 10 18 
Gold 0.164 0 0 0 5 10 15 
Silver 0.205 0 0 0 0 1 29 
Copper 0.227 0 0 0 4 19 7 
Lead 0.353 0 0 0 13 10 7 
Tin 0.240 0 0 0 5 9 16 
Zinc 0.291 0 0 0 0 1 29 
£/$, spot 0.150 0 0 0 12 15 3 
Corn (12) 0.342 0 0 0 0 0 30 
Corn (6) 0.328 0 0 0 0 0 30 
Corn (3) 0.336 0 0 0 0 0 30 
Cocoa (12) 0.180 0 0 0 7 14 9 
Cocoa (6) 0.189 0 0 0 10 15 5 
Cocoa (3) 0.170 0 0 0 17 9 4 
Coffee (12) 0.184 0 0 0 19 10 1 
Coffee (6) 0.352 0 0 0 11 15 4 
Coffee (3) 0.551 0 0 0 23 4 3 
Sugar (12) 0.238 0 0 0 0 3 27 
Sugar (6) 0.213 0 0 0 0 9 21 
Sugar (3) 0.188 0 0 0 Z 8 15 
Wool(12) 0.289 0 0 0 0 0 30 
£/$ (6) 0.178 0 0 0 9 17 4 
DM/$ (6) 0.269 0 0 0 2 15 13 
SF/$ (6) 0.284 0 0 0 1 9 20 
T-bond 0.031 0 0 5 11 11 3 
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Figure 2.1 Autocorrelations for the first US stock series, for returns (+), absolute 
returns (©), and squared returns (x) 


(excluding T-bonds). Figures 2.1 and 2.2 show the coefficients for returns, 
absolute returns and squared returns, for the first stock and the gold 
series. The dotted lines show +1.96/үл. 

The returns process is therefore characterized by substantially more 
correlation between absolute or squared returns than there is between the 
returns themselves. This means that large absolute returns are more likely 
than small absolute returns to be followed by a large absolute return, first 
noticed by Fama (1965). And, more generally, the distribution of the next 
absolute return can depend on several previous absolute returns. 

One possible explanation of the autocorrelation found in absolute and 
squared returns is changes in the variances of returns. This and another 
explanation, involving changes in conditional variances, are both 
examined in the next chapter. Day-of-the-week effects cannot explain the 
autocorrelation, shown in the appendix, Section 2(A). Neither can a linear, 
correlated, process provide a satisfactory explanation. To show this we 
need some straightforward but lengthy mathematics. The conclusion is 
very certain and the next subsection need not be read if the reader believes 
Tables 2.11 and 2.12 suffice to reject all linear models. 
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Figure 2.2 Autocorrelations for the gold series, for returns (+), absolute returns 
(0), and squared returns (X) 


Not linear 


A process (Х) is linear if there are constants и and b;, j > 0, and а 
zero-mean SWN process (ғ,) for which 


Х-и = > bjen, with by = 1. (2.10.1) 
j-0 
It will now be assumed that X, has finite kurtosis, к = E(X, — u)"/ 
(E(X, — u}? J. Let S, = (X, — А). 
Then, if {X,} is linear, and has autocorrelations р, x, it can be shown that 
(5,) has autocorrelations given by weighted averages 


EE БОЛАЙЫҚ; a, (2.10.2) 
! к-1 / к-1 


for certain constants a, determined by the Б). This is proved in the 
appendix, Section 2(B). When {X,} is Gaussian, к = 3 and р, ; = р? х, 
proved by Granger and Newbold (1976). Let Obethe proportional reduction 
in mean square error obtained by optimal forecasts, so 


= (var (X) — var (e) }/var (X). 
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Then the constants o, are constrained by 


(all 7 > 0), Y =, < 0/1 — 0). (2.10.3) 


т-1 


а, 2 


Consequently, for x > 3, 


K K 
0x Y ps maximum of р2х and 6/(1- 0). (2.10.4) 

т=1 
For returns we may assume that 0 < 0.05 because no one has been able to 
find a forecast anywhere near 5 per cent more accurate than the random 
walk forecast. Also, Table 2.10 shows that we may assume |p, x| < 0.15 and 
lp, x| < 0.1 for т > 1. As < > 3 for returns, a linear process will then have, 

by applying (2.10.4), 


30 
p.s < 0.06 апа У/р,; < 0.32. (2.10.5) 

т=1 
For every series, p, s has been estimated by the autocorrelations r, , of the 
data (x, — x)2; these estimates are almost identical to the autocorrelations 
r, x2. The totals rj, + --- + Bos estimate ру + --- + рзо 5 and range from 
0.48 to 4.91 for the stocks series and from 1.42 to 6.26 for the remaining 
series. All the totals > r,, exceed the upper bound for È p, ; given by 
(2.10.5), frequently by a considerable margin. Therefore linear processes 
cannot generate observed returns. Any reasonable model for returns must 

be non-linear. 


Consequences of non-linear structure 


Many hitherto standard methods of financial research appear unreliable 
once we appreciate that long series of returns are not generated by either 
a strict white noise or a linear process. Tests for zero autocorrelation 
between returns must begin by realizing that the process will not be strict 
white noise even if it is uncorrelated. Therefore the standard errors of 
coefficients r,, need not be 1/Vn. In fact the standard errors frequently 
exceed 3/\/n (Chapter 5), so new test procedures are necessary. Popular 
option pricing models assume at least strict white noise. Option pricing 
formulae can therefore probably be improved by using a better model for 
returns (Chapter 9). The parameters of the returns distribution, e.g. К, ш 
and с? for the Student's t distribution (Section 2.8), have been estimated by 
maximum-likelihood methods. However, the likelihood function can only 
be computed if, erroneously, strict white noise is assumed. 
Mathematicians and statisticians have researched linear processes for 
many years. All we need for hypothesis testing, model building, and 
forecasting, within the linear framework, has been published. Methods of 
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modelling non-linear systems are, however, at present (1985), few in 
number. Often these methods are not relevant for financial time series. 
New methods are still required and a number of innovations are suggested 
in the following chapters. 


2.11 SUMMARY 


Daily returns are characterized by firstly low autocorrelation, secondly 
approximately symmetric distributions having long tails and high kurtosis, 
and thirdly an unorthodox, non-linear, generating process. Stock returns 
have a positive mean butthis cannot be proved for futures returns. Models 
consistent with the preceding major characteristics of returns will be 
investigated throughout the remainder of this book. 

Monday returns have slightly higher variance than other returns and 
sometimes have a negative although small mean. Some returns may have 
marginally skewed distributions. These less important features of returns 
will usually be ignored when modelling the returns process. 


APPENDIX 2(A) AUTCORRELATION CAUSED BY DAY-OF-THE-WEEK 
EFFECTS 


Suppose that the distribution of returns depends on the day of the week 
according to the simple model: 


X, = Me + ое, Me = Pes; Or = 9;+5, 


with (&,) a stationary, zero mean, unit variance process. Autocorrelations 
calculated from data (x,, 1 < t < n) will be influenced by the daily effects 
ш, G;, 1 < į < 5. The numerical consequences, however, are very slight. 
Likewise, the consequences of month effects can be shown to be very 
small. 


Returns 


As n — c, the autocorrelation estimators R, x, defined for n random 
variables X, by > (X, — X)(X,,, — X)/X (X, — X)! converge with probability 
one to a limit. Sample estimates г, , „ will converge to the same limit. This 
limit will be denoted by т, x. It would equal p, x if (X,) was stationary. As 


п-> 5, 


n 5 
X= У xn > [> м)/5-». say, 
t=1 i=1 


i 


and 
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п-т 5 
У X,X,,,/n = [> xx.) /5. 
t=1 і-1 


Now assume (еу) is white noise. Then it can easily be shown that the daily 
effects u, and g, cause the (asymptotic) estimated autocorrelations 


5 5 | 
T, x = > (u, — gus. — ”/> (4, = ш)? + ai. (2.A.1) 
1-1 іп 


Clearly т, x = 7,45,x for all 7 > 0 and it can be deduced that z; х = тах and 
Tx = Tix. 

To illustrate the small magnitudes of the т, х, suppose д, and o; are 
defined by French's estimates for the Standard and Poor's index, see 
Section 2.5. Then (2.A.1) gives these values of т, x: 


T T 
1,4 —0.003 
2,3 — 0.005 

5 0.017 


Squared returns 


Let т, х: be the limit of estimators К, x:n- It is convenient to define 
y = ЕХ] = u? + o? and y = (y, + +  y/5. Then, as n о, 


n 5 

Y Хп > р нж) /s zum 
t=1 1=1 

апа 


5 
XXn р вх?) /s 


t=1 1=1 


Now assume { є, } is strict white noise with E[e;] = бапа Е[є{] = A. Then, it 
is easy to show that 


5 5 
пх = У (Yi yy, — »/3 (yi = у)? + 4ш?0? + (А — 1)o7. 
ісі 1-1 
(2.A.2) 


Again using French's estimates, (2.A.2) gives the following low auto- 
correlations firstly for the Gaussian case À = 3 and secondly for À = 6. 


T A=3 A=6 
1,4 —0.006 —0.002 
2,3 —0.003 —0.001 


5 0.017 0.007 


60 Modelling financial time series 


A second example is based on the average variance proportions for 
futures, presented in Section 2.5, with all и, = 0, as follows: 


T A=3 À = 6 
1,4 —0.002 —0.001 
2,3 —0.001 —0.000 

5 0.005 0.002 


APPENDIX 2(В) AUTOCORRELATIONS OF A SQUARED LINEAR 
PROCESS 


Suppose {X,} is linear with mean џи and (ғ,) is zero-mean, strict white 
noise, with 


Х-и = > Бе, |. 
і=0 


We assume ЕХ] is finite. Then, without further loss of generality, we can 
assume b, = 1and £[e?] = 1. Let À = F[e?] and define 
5, = (X, x ш)? - > biet, +2 > > bib & 6j, 
i <j 


and 
x= 2.2 
Su > bier. + 2 > > by Буе, 4.671: 
k k«l 


Then F[S,] = X b?. To find the autocorrelations of { 5,} we need Е(5,5,,,1. 
To get this expectation consider the general product of four innovations 
and its expected value: 


fu, j, k, = Бе, j£, £ kr - il. 


The function f will be zero unless (a) i = j= k — r = /— ror (b) there are 
two identical pairs of subscripts, e.g. i = j = k — т = | — r. Case (a) gives 
f = À, case (b) gives f = 1, and otherwise f = 0. Also when í < jand k < |I, 
f can only be non-zero ifj = k — randj = / — т. 

It can now be shown, by straightforward algebra, that for all 7 > 0, 


cov (S, S...) = EIS S, о] — ELS? 
2 
bb.) | 


= (A — 3) p bibi, + 45 


і-0 


Ш 
о 


Consequently, 


_ (à - 3 E bib, + 2X bib, 
Prs (А — 3) Z b! + 2(> bey 
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Now, firstly, X, has kurtosis к related to the kurtosis A of e, by 
(к — ЭКА — 3) = > BCD bi)? 

if A = 3, and secondly the autocorrelations of (X,) are 
Pr,x = > bibl», bi 

and, thirdly, the autocorrelations of the process 
Xt = У big, are а, = Y bibi,, >. bi. 

Hence our previous equation for p, ; can be simplified into 


(к — 3)a, + 2p x 
к-3+2 


Dp, s n 


This is equation (2.10.2). To establish bounds for the non-negative quan- 
tities о,, note that 


oc oc ж 2 oc 
Y Y nb, = 1 (5 в) -x ы) 
i=0 i=0 


ОРОТ 


Defining Ө by (var (Х,) — var (&))/var (Х,) and using 


and so 


У Б = 170-6), У Бі 21, 
і=0 і-0 


it follows that (2.10.3) is true, i.e. 


= 1 1 0 
<-4----,- 1; < ———.. 
3 s; EET. 


Chapter 3 


Modelling Price Volatility 


This chapter describes models for returns having either non-stationary 
variance or, conditional upon past observations, a variance dependent on 
such observations and additional variables. The models are non-linear, 
have high kurtosis for returns, and positive autocorrelation between 
squared returns. Thus they are consistent with the major characteristics of 
the returns process identified in Chapter 2. 


3.1 INTRODUCTION 


Variations in the level of activity at a market will cause changes in the 
variances of daily returns. Activity can be measured by many variables 
including trading volume and the amount of relevant new information. 
These variables change from day-to-day and so can the volatility of prices. 
Specific models have been described by Clark (1973), Tauchen and Pitts 
(1983), and others, summarized and extended in Section 3.5. 

At first, in Sections 3.2 to 3.6, we consider variance models for which the 
returns process is uncorrelated. Later, Section 3.7 extends the models to 
include autocorrelation among the returns and Sections 3.8to 3.10 present 
parameter estimates for stationary models. 

A stationary model must have constant variance but certain conditional 
variances can change. For a non-linear stationary process (X,j, var (X) is 
the same for all t but the conditional variance, var (Xx, i, X, ...), of X, 
given observations х, _1, X-2, etc. can depend on the observations. Simple 
examples clarify this in Section 3.2. 

Section 3.3 describes a general method for modelling changes in vari- 
ance or conditional variance based upon changes in market activity. 
Specific examples have a variety of variance structures: non-stationary 
variance (Section 3.4), changes in conditional variance caused by factors 
independent of prices (Section 3.5), and conditional variance changes 
caused explicitly by prices themselves (Section 3.6). 

Much of this chapter is devoted to stationary models for returns. These 
models can be very useful especially when simulating price behaviour. It is 
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not unreasonable to expect a stationary model to adequately approximate 
the true process for a relatively short period, say one year. Over longer 
periods, however, some non-stationarity might be expected. It is far from 
easy to determine the applicability of fitting stationary models to series 
covering several years. After fitting models in Sections 3.9 and 3.10 and 
assessing suitable forecasts in Chapter 4, it will be concluded in Section 4.4 
that stationary models can be appropriate for some of our long series but 
not for all of them. 

Some readers may prefer to skip substantial amounts of the mathemati- 
cal material in this chapter. This can be achieved by skipping Sections 3.2, 
3.6, 3.7, and 3.10. 


3.22 ELEMENTARY VARIANCE MODELS 


Changes in variance or a conditional variance can explain why there is far 
more autocorrelation between X? and X?,, than there is between X, and 
Хо, for low 7, {Х,} being the returns generating process. This is of course 
the non-linear characteristic of returns identified in Section 2.10. To 
motivate the conclusion about changes in variance, and to show why 
conditional variance can be relevant, we first consider very simple variance 
models. These models restrict the conditional variance of X, to be one of 
two possibilities, either oj or ø$ with a, # о and о), o> > 0. The models 
are too simple to describe returns adequately but they illustrate important 
features of non-linear fluctuating variance models. Realistic models per- 
mitting an infinite range of variances are introduced from Section 3.3 
onwards. 

Two ways that the variance can appear to change are now considered; 
{X} is non-stationary for one way yet stationary for the other. Initially the 


X, are given discrete distributions, subsequently continuous distributions 
are considered. 


Step change, discrete distributions 


Consider a variance change halfway through a series of n variables (n 
even), so that var (X) = a?for1 < t < inandvar (X) = о? іюгіп < t n. 
Suppose each X, is either a, ог-оу (t < in) oris either one of a; and —o; 
(t > $n). Also suppose the X, are independent random variables having 
P(X, > 0) = 5, all t; the notation P(A) denotes the probability of the 
bracketed event A. 

This process {X,} is non-stationary and hence non-linear because var (Х,) 
depends on the time t. It is uncorrelated because the X, are independent of 
each other, so the sample autocorrelations of the x, will be small. However, 
the series of squares х? is certain to consist of л consecutive observations 
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a; followed by in observations 02. Easy algebra shows that if o? = 
(0? + 03) the sample autocorrelations of the squares are 


RET 


Be = Y (xt — о?)(х?,, - 5 2 (х2 — o2)2 = (n — Зт)/п 
t-1 


for1 < + < 4nandany variances oí and o3. The variance change can cause 
a very high measured autocorrelation. 


Markov variances, discrete distributions 


Next consider a model in which the variance is stochastic, in a certain 
sense. It is necessary and important to distinguish between the variance of 
X, and some conditional variances. A process of conditional variances, 
denoted (V2), follows a Markov chain in this model. Each V, is either о; or 
о; with equal probabilities and V, = М.) with chance а, otherwise 
V, Z V, 4. Assume a Z з їо avoid trivial results. As before, suppose each X, 
is discrete and the only possible realized values for x, are оц, —o, (when 
V, = о) and о, -o (if V, = о»). This model can be described as follows: 


X, = WU, (3.2.1) 
(U,) is strict white noise, 

P(U, = 1) = P(U, = —1) = 2, (3.2.2) 

Р(М-о)- iid =o) =, alt, (3.2.3) 

P(V, = v| Vj = v, all j > 0) = a if v, = yy (3.2.4) 


lI 


1 — aif v, wa 


and {0%}, (V) are stochastically independent, i.e. the vectors U = 
(Ui, Uz, ..., U.) and V = (V,, №, ..., V.) are independent for all integers 
n. Notation of the form P(A| B)appears in (3.2.4) and denotes the probabil- 
ity of A conditional upon the event B being true. 

As {U,} and {V,} are strictly stationary processes, and they аге indepen- 
dent of each other, it follows that (X,) is strictly stationary. In particular, the 
X, have constant variance, 


var (X) = (01 + 0%) 
and they are clearly uncorrelated as 


cov (Xo, X.) = EIX, Xir] 
= ELV, VELU JEU, +] = 0 


because the U, are zero-mean strict white noise. Now if x;_, = оу, у, must 
be o, and so 


P(X, = o,|X, + = оу) = 3P(V, = eil Va = оу) = 2a 
# Р(Х, = о) = 5, asa Z 5 
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Therefore X,., and X, are not independent random variables. Con- 
sequently, when a # 2, the process (X,) is white noise but it is not strict 
white noise. And as the only linear white noise process is strict white noise, 
{X} is not a linear process. 

Although the X, have constant variance, it is important to realize that the 
variance of X, conditional upon the past observations /,_; = {X;-1, X» 5, ---} 
is not constant. As x7 , = V7_1, given |, , the distribution of X? = V+ 
becomes x?. , with chance a and g} + o — x2_, with chance 1 — а. Thus, 


var (X, |l...) = EIX?! l.i] = ax? + (1 — aoi + o$- x 4). 


This conditional variance depends on the previous observation x, +. The 
dependence of var (X,|/, 1) upon |, | contrasts with the result that any 
invertible linear process 


Х = s, + > 963, (сұ) strict white noise, 
і-1 
has constant conditional variance, var (X*|/*_,) = маг (e,), for all possible 
sets If = (xT 4, ХЖ), ---}. 
For a nearly equal to 1, the observed squares x? will contain long 
sequences of identical valies: for most times x? will equal x? |. This means 
the squares X? are highly autocorrelated. As X? = №, it is easy to prove 


E[X?] = (0? + оз), Е[Хі] = (с + о and 
EIX Xi] = «ЕУ V +l V, = Val 
+ (1 — а) М М | V, A Val 


= la(o1 + 02) + (1 — о)о%02 


and hence it can be shown that 


pix: = cov (OG, X3, var (X$) 
= {10а — 1)(01 — e? V (ito — o3?) 
= 2а — 1. (3.2.5) 


Likewise it can be proved that р, x: = (2a — 1)", т > 0. These autocorrela- 
tions are the same as those of an AR(1) process with autoregressive 
parameter 2a — 1. Equation (3.2.5) shows that if the x, have low sample 
autocorrelations but the squares x? have far higher autocorrelations then it 
is wrong to conclude that the variance is not constant. The correlation 
among the squares can be caused by changes in conditional variance. 


Step variances, continuous distributions 


So far the distribution of X, has been very simple and this has the advantage 
of ensuring easy results for the autocorrelations of the squares X$. More 
realistic and interesting results are obtained if, given a conditional variance 
№2, X, has the normal distribution N(0, v2). 
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Hsu (1977, 1979) has tested a single variance change model on US stock 
returns. Simplifying the model by supposing E[X,] = 0 gives: 


X, = o, U, 1<t=<m 
= aU, m+1st<n 


with {U,} a Gaussian, white noise process having U, ~ N(0,1). Hsu tested 
the null hypothesis o, = o» against the alternative о; = o», with m 
unknown, for 161 weekly returns on the Dow Jones Industrial Average 
from July 1971 to August 1974. Conclusive evidence of an increased 
variance from mid-March 1973 onwards was reported and attributed to a 
surge in news about the Watergate incident. Menzefricke (1981) and Hsu 
(1982) have given Bayesian analyses of the location of the unknown 
change-point m. 

The process {X,} is obviously non-stationary and uncorrelated, suppos- 
ing a, # o». The squared process (X?) does not have meaningful auto- 
correlations. However, sample autocorrelations calculated from squared 
returns should be positive for low ras the first m of the X? have a different 
expectation to the final n-m squares. The autocorrelations r, x:n for n 
squares will have an asymptotic limit if the ratio f = m/n is fixed as n > o. 
The limit is constant for all positive 7 and lengthy algebra shows it is the 
positive number 


т = (02 — a3) /((o3 — 03)? + 201 — f) 7104 + 28-10%). (3.2.6) 


Very approximately, Hsu's estimates are f = } and ø$ = 3o? and then ті 
1/1 = 0.09. 


Markov variances, continuous distributions 


The model just described has one change in the variance of X, and the X, 
have conditional normal distributions. Now suppose at the beginning of 
every day there is a chance (1 — а) of a variance change from one of oj or 
a to the other possible variance, described earlier by equations (3.2.3) and 
(3.2.4). These variance changes could reflect occasional changes in the 
amount of news about the goods, between little news and much news or 
vice versa. Supposing again that the conditional distribution of X, given its 
variance is normal with zero mean, X, is once more the product WU, with 
{U,} Gaussian, white noise. The process {X,} is stationary and uncorrelated 
with variance (тї + o). 

Given specific information about the conditional standard deviations 
v,.;, j > 0, the random variable X, has a conditional variance. In particular, 


var (X| v) = w 


and 


Modelling price volatility 67 


var (X| V1, мо...) aiu + 0— alo} 02— va) (327) 


for any v,_,, мз, etc. (Notation of the form А |Б;, b;, ... occurs in (3.2.7) 
and refers to the distribution of A given realizations bi, b}, ... of appro- 
priate random variables.) Unlike the discrete Markov model, it is impos- 
sible to describe the variance of X, conditional upon all past observations 
l-1 = {%_,, j> 1}. However, the variance given x,_, can be calculated. Let 
L(x|a) denote the likelihood of an observation x from the normal distribu- 
tion N(0, a?) and let À be the likelihood ratio L(x, .4|o:)/L(x,.1| о). Then it 
is possible to prove the result 


2 _ 2 2 _ 2 
T (Х|) = Alagi + (1 ei + ac + (1 a)i 


and this conditional variance is a function of x,_,, assuming о # > and 
G, É ©. 

To find the population autocorrelations of the stochastic process {X,’} it 
is convenient to work out E[X?], E[X2X2,,] (z > 0) and E[X/^]. The first two 
expectations are unchanged from the discrete Markov model but £[X7] 
becomes 3(a4 + 0$)2 as E[U1] is now 3. It can then be demonstrated that 

(a4 — 05)? 


< (1/5)(20 = 1), т>0. 


(2a — 1)? (3.2.8) 


Тһе continuous distribution of ће U,, compared with the earlier discrete 
distribution, causes smaller autocorrelation between the squared X,. If wis 
almost 1, the autocorrelations will decline slowly from some positive 
number less than 0.2. The function (3.2.8) has an ARMA(1,1) shape, i.e. 
there are ARMA(1,1) processes having the same autocorrelations as (X7). 

This model shows that occasional variance changes with conditional 
normal distributions can explain significant positive autocorrelation be- 
tween squared returns. The model also shows that variance changes can 
explain high kurtosis, for 


ky = E[Xš]E[X:] 
= 3(1 + (02 — оз) (02 o5) > 3. 


The unconditional distribution of X, isamixture of two normal distributions 
and the mixing causes the higher kurtosis. 


3.3 A GENERAL VARIANCE MODEL 


The return x, can be calculated at the end of the day t. This number can 
always be interpreted as being the realization of a random variable X,. 
Before day t begins the return x, cannot be known and so a probability 
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distribution for X, can be used to describe the possible outcomes x,. Then, 
by the end of the day, various people, companies, and governments will 
collectively have helped to determine a particular outcome x,, depending 
mainly on new information and interpretations thereof. 

In a similar way, suppose that by the end of day t the market has 
determined a conditional standard deviation v, so that, for fixed v,, x, is an 
observation from a distribution having variance v;. New information about 
the goods traded or other sources of economic and political information 
could partially determine v,, as also could the changing preferences of 
investors for different goods. During day t — 1it is reasonable to assume v, 
is not known exactly although good forecasts may be available. Con- 
sequently, v, can be viewed as the realized value of some random variable 
V,. Two important questions are then: firstly at what time does the market 
determine v, and secondly what sort of information and type of market 
behaviour determines v,? Several possible answers will be summarized 
later. 

Models consistent with the preceding discussion of a conditional stan- 
dard deviation have been proposed by several researchers, including 
Granger and Morgenstern (1970), Praetz (1972), Clark (1973), Epps and Epps 
(1976), Ali and Giaccotto (1982), Engle (1982), Tauchen and Pitts (1983), and 
the author. These models can all be written in the general form: 


X, = u + V, U, (3.3.1) 


with {U,} a standardized process, so F[U,] = 0 and var (U,) = 1 for all t, 
and (Vj) a process of positive random variables usually having 
var (X,|w). = vz; also F[X,] = и forall t. Nearly everyone has assumed the 
U, have normal distributions. Most people suppose the processes (М) and 
{ О. } are stochastically independent but this is not necessary as Engle (1982) 
has shown. Stochastic independence is sufficient but not necessary to 
ensure var (X,|w) = v+. For many markets it can be appropriate to assume 
u = 0 and u will always be very small compared with the v. 

Three classes of processes for (М) will be reviewed. Firstly, occasional 
changes in variance so that V, = V,,, with high probability (Section 3.4); 
secondly, changes in conditional variance caused by economic forces 
independent of the market (in the sense that {V,} and (UJ are stochasti- 
cally independent), perhaps with all processes stationary (Section 3.5); 
and, thirdly, changes in conditional variance caused by past returns so that 
V, depends only on /,., = (x.,, j > 0j (Section 3.6). All these processes for 
(V) will initially be described with the simplifying assumption that (0) is 
strict white noise (SWN). Later, in Section 3.7, modelling dependence 
between the U, will be introduced. 

Assuming {U,} to be SWN and making the reasonable assumption that 
Un; is independent of the vector (U,, М, Маз, ..., М) for all i > 0, we 


Modelling price volatility 69 


obtain the expected result that the X, are uncorrelated as, for 7 > 0, 


cov (X, Mar) = ELV, Vitr ОО.) = EIV Vi ОА, = 0. 


Notation 


We will often want to describe results for the squared returns after 
adjusting for the mean и. These are defined for random variables X, by 


5, = (X, — uy = М0. (3.3.2) 


When studying observed returns x, the mean џи is usually unknown but can 
be estimated by a sample mean x. In these circumstances sample adjusted 
squares are defined by s, = (x, — X}. Occasionally ш will be assumed 
known and the definition s, = (x, — и)? used, particularly in Section 3.6. 
Absolute returns will also be analysed, defined by 


M, = |X = u| = VU (3.3.3) 
for random variables and for observed data by т, = |х, — x|, ш unknown, 
or m, = |x, — u|, и known. 

We have now defined several stochastic processes in this chapter and 


there will be more later on. Table 3.1 summarizes the processes considered 
in Chapter 3. 


Table3.1 Processes considered in Chapter 3 


Process Definition References and notes 
A Ап autoregressive process See Section 3.6 
M M, = |X, = щі 
5 S, = (X, — ы) 
T T, = U, — г, SeeSection3.7 
U U, = (X, = н), E[U,] = 0,var (U) = 1 
V Generates conditional standard deviations — SeeSections3.3and3.7 
w Countsintra-day pricechanges See Section3.5 
X Generates returns E[X] = и 
£ Strictwhite noise E(e,] = 0, see Section 3.7 
7] Strict white noise Eln] = 0, see Section 3.5 


3.4 MODELLING VARIANCE JUMPS 


Extraordinary political events occur from time to time. Often the events are 
unexpected. We have already noted Hsu's evidence that revelations about 
the involvement of the US president in the Watergate affair increased the 
variance of US stocks in 1973. Another dramatic event was the Russian 
invasion of Afghanistan in 1979, causing turmoil in financial markets, 
especially the gold market. 
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If variance changes are solely caused by extraordinary events then there 
will only be a small probability of a variance change during any trading day. 
Most pairs of consecutive days (tand t + 1) will then have equal conditional 
standard deviations (v, = v,,,). Extraordinary events will usually increase v, 
and presumably after a period of time the event will be forgotten and the 
conditional standard deviation will fall. It seems sensible to regard {V,} as 
non-stationary as itis difficultto see how the consequences of very unusual 
events can be accommodated within a stationary model. The market does 
not cause extraordinary events and hence the variance jumps, so it is very 
reasonable to assume (V,) and (U,) are stochastically independent. Many 
events occur when markets are closed so it is fairly accurate to assume the 
realized value v, is determined before trading begins on day f. 

Variances have often been linked with trading volumes (e.g. Clark, 1973). 
A jump in variance therefore suggests a jump in trading volume as the 
number of traders and their average trading frequency changes. An un- 
expected, important event could cause a wider variety than usual of 
opinions about a fair price, and thus a higher trading volume. 

A time series of returns will contain an unknown number of jumps, if a 
jump model is appropriate. Hsu (1977, 1979, 1982) only considers one 
jump, Ali and Giaccotto (1982) suggest up to four jumps within an eleven- 
year period. Extraordinary events are by definition rare, so only a few 
jumps are likely in a series. Accurate estimates of v, can then be found for 
most times. The autocorrelations г, , of the adjusted squares s, should be 
positive; for in a total > (S, — S)(S,,, — S) the products (S, — S)(S,,, — S) 
have positive expectation for the vast majority of times that v, = V+. 
Equation (3.2.6) suggests the positive autocorrelation will be fairly small. 


3.5 MODELLING FREQUENT VARIANCE CHANGES NOT CAUSED BY 
PRICES 


General models 


Linking variance changes solely to extraordinary events is probably an 
unrealistic simplification of the process generating {М}. Many events 
could cause changes in conditional standard deviations with minor events 
causing smaller (yet still important) changes. The relative popularities of 
markets are always changing. So too is the amount of relevant new 
information each day, whether it be economical or political. Therefore we 
now consider models for which v, can be expected to change daily, 
although on most days the changes v, — v... will probably be small. Two 
types of model are considered. In this section we assume that the process 
determining the v, does not depend on prices so that 
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V, = (Vila Va, Voas ss т) (3.5.1) 


for some function f and a random innovation m, independent of all past 
returns x,_,, j > 0. The alternative possibility, that returns do cause the у, 
is introduced in Section 3.6. 

The easiest way to explain how market forces can make v, a function of 
events occurring on day t is to represent the return X, as the sum of 
intra-day price movements: 


W, 
X, — w= 5 о. (3.5.2) 
1=1 


In this equation, W, and every о, are random variables. Suppose the в, are 
independently and identically distributed as N(0, 73), and аге indepen- 
dent of W,. Then given an observed number w, of intra-day price move- 
ments the conditional standard deviation is simply v, = o, Vw. Returns 
will display non-linear behaviour if the W, are autocorrelated. 

Tauchen and Pitts (1983) describe an economic model for the reactions 
of individual traders to separate items of information. They assume u = 0 
and identify w, with the number of relevant information items during day t. 
Each news item causes a price movement а, and a trading volume 
vi ~ N(u,, 62). Conditional upon w, items of information, X, ~ N(u, o2 W,) 
and the day's trading volume is ~N(u,w,, с2и,). These conclusions show 
the distributions of returns and trading volume can depend on each other 
in a complicated fashion. Earlier papers, e.g. Clark (1973), restrict the 
conditional variance of X, to be a deterministic function of volume, 
corresponding to c, = 0. Tauchen and Pitts assumed {W,} is strict white 
noise. This must, however, be inappropriate, as then {X,} would also be 
strict white noise and we know this is not so. 

More generally, we can identify v, with a level of market activity covering: 
the amount and importance of new information, trading volume, the 
number of active traders, interest in the market relative to others and 
perhaps also seasonal factors. Many of these activity measures can only be 
assessed at the end of the day's business. Thus, in this type of model, v, is 
realized at the end of day t. 

As it is supposed that prices do not cause the conditional standard 
deviations, it is reasonable to assume (V,) and {U,} are stochastically 
independent. This can be proved true for model (3.5.2) with о, ~ М(0, a2). 
Let V, = c, VW, and 


_ Х-и 1 E 


V, c, VW, i=1 


Given any realization v,, w, is v2/a2, so E wp ~ N(0, у?) and U, ~ N(0, 1). 
As the conditional distribution of U, is the same for all v,, U, and V, are 


U, 


а). 
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independent. Likewise, the vectors U = (U,, U,,..., U,) and V = (V, Vy, 

.., V4) are independent for all n, establishing stochastic independence, 
providing all sets of variables а», are independent of V; i.e. individual 
intra-day price changes are stochastically independent of how many price 
changes there are. 

When (X, — м) is the product of stochastically independent processes 
{V,} and (U) we will call {X,} a product process. 

Note that in practice only one realized value is observed each trading 
day, namely x,, but that two innovations determine x,. One innovation (m, 
in equation (3.5.1)) will cause the level of market activity v, and the reaction 
of the price at this level of activity is summarized by an independent 
innovation U,. Given a series x,, 1 < t € n, it is impossible to recover the 
realized values v,. Nevertheless, it will be shown later that reasonable 
estimates of v, can be found. 


Stationary models 


The process (V;) could be stationary or non-stationary. Strictly stationary 
processes imply various results which are derived here and checked 
against empirical evidence in later sections. Іп due course it will be 
concluded that some returns series are non-stationary whilst others are 
satisfactorily modelled by stationary processes. 

Results for stationary ( V,) and (U,) can be obtained quickly by applying 
their assumed stochastic independence. It will also be assumed that both 
V, and U, have finite fourth moments. Let ky denote the kurtosis of the U,, 
so ky = E[UT] as E[U,] = 0 and E[U:] = 1, and let 6 denote the mean 
absolute deviation of the U, so 6 = E[|U,|]. For normal U,, ky = 3 and 
8 = VQ/m) = 0.798. 

The process {X,} is stationary with variance: 


EUX, — &*] = EIV} U$] = ЕМУ ЕШ] = E[V?]. 


This is a constant although, of course, the conditional variances var (Х | м) 
equal the time-dependent quantities v?. The kurtosis of the X, is given by 
k = EO н) _ МП _ КЕТИП (3.5.3) 
E[(X, — м1 ЕУ] ЕУ] 
Аз F[V?] > Е[У{]? for all positive variables V, having positive variance, 
ky > ku. The unconditional distribution of X, is a mixture of the conditional 
distributions + v,U,, mixing over all possible realizations v,. The result 
ky 7 ky confirms that mixing distributions having different variances 
increases the kurtosis. 
As {U,} is assumed at present to be white noise, (X) is also white noise. 
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However, the adjusted squares $, = (X, — и)? can be autocorrelated; 
EIS} = E[V?]andiif r > 0, 


E[S,5,..] = ЕГУ? ЫЛЕШ U a] = ЕУ У +1, 
so 
cov (5,, S) = ҒІ5,5,,1- ҒІ5І? = cov (V2, V?..). 


Consequently, the autocorrelations p, , of {S,} are related to the auto- 
correlations p, уг of (V?) by the linear relationship: 


p-s = p, v.lvar (V2)/var (S;)]. (3.5.4) 


The positive autocorrelations r... and r,, observed among observed 
squares x? and adjusted squares s, — (x, — x)*, in Section 2.10, is consistent 
with equation (3.5.4) if the process of conditional variances (V?) is posi- 
tively autocorrelated. Let a, = £[V/]. Then var (V?) = a, — аҳ, var (S,) = E[S?] 
- E[S,]? = kya, — aš апа so (3.5.4) implies 


0x D. sl D. v: < ШӨ 


for any distribution of the М,. 
The absolute returns M, = |X, — и | have mean E[M,] = ЕУ, |.|] = &E[V,]. 
Also E[(M?] = E[V?] and so it is easy to obtain var (Mj). For 7 > 0, 


EIM, Miss] = ELV Vis JEU U, Uis |] = 87 ELV, Viu, 
cov (М, М.) = E[M,M,.,] — ЕМ] = 87 cov (V, V.) 


and consequently the M, have autocorrelations 
рым = 8p, ylvar (V,)/var (Mj)]. (3.5.5) 


In terms of a, = E[V/], var (М) = a; — at, var (Mj) = а, — 67а? and thus 
var (Vj) < var (Mj) giving 


0 < p, up, y < 62. 


The lognormal, autoregressive model 


All the preceding results are as general as are possible. Constructive 
models for (V,) will have few parameters whilst permitting the mean and 
variance to be unconstrained and the feasible autocorrelations to cover a 
range of possible values consistent with the sample autocorrelations of |х, | 
and x. The models to be described have three parameters. It is assumed 
that {U,} is Gaussian white noise with U, — N(0, 1). 

The distribution of V, cannot have positive probability of a negative value 
so a normal distribution is inappropriate. Empirical distributions of sample 
variances have more extreme variances in the right tail than the left tail 
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(consider Table 2.5). A right-skewed, positive-valued, distribution is 
required. The lognormal family of distributions is the most convenient for 
obtaining straightforward mathematical results. It has also been found 
satisfactory, and better than alternatives, for empirical modelling by Clark 
(1973) and Tauchen and Pitts (1983). There are two parameters and we 
adopt the notational definition 


log (М) ~ No,B), | B» 0. 


The density function has already been given by equation (2.8.4) and it is 
well known that E[V,] = exp (a + 282). As Іор (Vf) = r log (Vj, all terms 
a, = E[V{] can be calculated and are exp (ra + 2r?8?). In particular, a,/ 
а? = exp (48?) and equation (3.5.2) shows the kurtosis of returns is 


ky = Зе? (3.5.6) 


which can be arbitrarily large. 
To find the constants in equations (3.5.4) and (3.5.5), the following 
variances can be derived: 


var (М) = e% (её — 4), маг (V2) = ее (e4P* — 1), 
2 2 > 
var (М) = +P (eP — 82), маг (S) = е“ (зе — 1). 


Consequently, as 8? = 2/7 for the standardized normal distribution, 

P- slp уг = (e^ — түзе” — 1) = А(8) (3.5.7) 
апа 

p. lp, v = {ef -1)(те” — 2) = В(В) (3.5.8) 


for any 7 > 0. The ratio A(8) increases monotonically with 8 from A(0) = 0 
to the upper bound 1/3 as B > œ. Similarly B(8)ismonotonicandbounded 
by 0 and 2/7. 

The autocorrelation coefficients and plots presented in Section 2.10 
show that any sensible stationary model must have p, ; and p, л; positive for 
several lags т, hence this must apply to p, y and p, v: also. The simplest 
possible model having positive autocorrelations at several lags is an AR(1) 
process for log (М): 


logg(V) — а = é(logg(V.)-o) +m, ф>0. (3.5.9) 


The innovation process (m,) is zero-mean, Gaussian white noise, stochasti- 
cally independent of {0%}; var (y) = B*(1 — ф?). A value of $ near to 1 
would mean the conditional standard deviations change gradually. 
Granger and Newbold (1976) prove that the autocorrelations of the 
Gaussian process (log (V,)) and the non-Gaussian (V,) are related by 


p.v = {exp (8?р, кеу) — 1}/{ехр (82) - 1) 
- {Pr togv * 3B? p? ову + {1 + ig? t e) 
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To obtain p, уг, replace В? by 48? in the right-hand side of these equations. 
AS р, огу is ó "for the AR(1) process, the autocorrelations of the observable 
processes {5,} and (M,} are 


A(B) {exp (482ф°) — 1V/(exp (48?) - 1) (3.5.10) 


[^ 


and 
p. = В(В) (exp (8*6? — 1}/{exp (8?) — 1). (3.5.11) 


Each of these autocorrelation functions steadily decrease, beginning from 
a number less than 1 and declining towards 0 as 7 increases. The functions 
are very similar to the autocorrelations of an ARMA(1,1) process, namely 
Са" for constants C and a (equation (1.9.13)), with 1 > С> 0 and a > 0. 
Consequently, the autocorrelation functions given by equations (3.5.10) 
and (3.5.11) are reasonably consistent with the plots dispayed in Section 
2.10. 

The lognormal, autoregressive model has four parameters, а, B, $, and 
p. These are estimated from the time series of observed returns in Sections 
3.8 and 3.9. It is not possible to find the conditional variance given all past 
observations, i.e. var (X,|X, 4, Xj», ...), because the component series 
{v,} and (uj) are not observed. If, somehow, the past v, can be discovered 
then 


var (X| V-14; ...) = var (X; | v1) = EIV? | v] 
= 2% exp (2(1 — фа + 28021 — $?)) (3.5.12) 


but in practice this variance cannot be calculated; an estimate of v, , could 
always be substituted and used to help estimate quantity (3.5.12). 


3.6 MODELLING FREQUENT VARIANCE CHANGES CAUSED BY PAST 
PRICES 


General concepts 


In the previous section it was supposed that the process determining the 
conditional standard deviations does not depend on past returns or prices. 
Now an extreme alternative to this assumption is considered, namely that 
conditional standard deviations are a deterministic function of past 
returns. Thus, 


V, = ҚА, X-2, ...) (3.6.1) 
for some function f; a very simple example is 
БХ, 1) = (ag + о1(Х,1 — ылу” 


with оо and a, both positive. We assume throughout this section that 
U, = (X, — ш)! V, is normal distributed and independent of all U,_, and all 
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М, í > 0. The X, are then uncorrelated. Also, given past returns l; = 
{%—1, X22; --.), v, can be calculated and then X,|/,_, ~ N(u, v2). 

This approach to modelling changes in conditional variances is due to 
Engle (1982). This section is based on his paper and provides further 
models and mathematical results. Engle's work is directed at general 
econometric models possessing constant variance yet changing con- 
ditional variance. Its intended value for modelling financial time series is 
unclear (Engle, 1982, p. 989). However, the ideas are certainly important as 
they offer potential models for changing conditional variances and these 
models can and will be assessed using observed data. 

There are notable differences between models for X, based upon con- 
ditional variances firstly caused by H,_, = (X,_,, Х-;,...), as in this sec- 
tion, and secondly caused by J, = (m, Vi, V2, ...), as in Section 3.5. In 
the first case, only one innovation per unit time determines X, (namely U,) 
but, in the second, two innovations are required (namely U, and y). When 
it is correct to model variances using Н, |, the realized value v, can be 
observed at the end of day t — 1. However, if the true process uses J, then 
v, is determined later, at the end of day t, and the realized value cannot be 
observed. Another comparison is based on the relationship between the 
processes (М) and (U,). Variance models based on H,_, do not have 
stochastic independence between the two processes (a proof follows 
later), but it has already been argued that stochastic independence is a 
reasonable assumption when /, determines the conditional variance. 

Models based upon (3.6.1), with X,|/,., ~ №и, vz), have the very desir- 
able property that any parameters in the function f can be estimated by 
maximizing the likelihood of observed data хі, x,, ..., x,. Denoting the 
density function of №и, v?) by g (x|v,), the likelihood of the data is simply 
the product gov) (х | v) ... g(x,1v,). Maximum likelihood estimates 
are given in Section 3.10. Their availability is a considerable advantage over 
the models given in Section 3.5. However, the models presented in this 
section have very complicated unconditional distributions and it is difficult 
to establish conditions for stationarity and then to find the moments of 
the X. 

By Section 4.4 we will have described enough results to compare the 
models of this section with those introduced in the previous section: it will 
then be concluded that the product processes of Section 3.5 are the better 
models. 


Caused by past squared returns 
Engle defines a zero-mean, ARCH(p) process, {X,}, by 


p 1/2 
X, = E + У, ах) U, (3.6.2) 


f=1 
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there being p + 1 non-negative parameters а;, with a > 0 and (U,) 
Gaussian white noise, with U, — N(0,1). The acronym ARCH refers to 
autoregressive conditional heteroscedasticity. A logical extension of (3.6.2) 
is given by replacing X, by X, — u for all times t, to give ап ARCH(p) process 
having constant mean E[X,] = и. 

Consider the process when pis 1. Then, 


X,— p = {œ + «(Ху — uy )"?u, 
and 
var (X,|x_,) = ag + @ (X1 — p. (3.6.3) 


Large deviations of x,_, from the mean и then cause a large variance for the 
next day. The process is stationary with finite variance if, and only if, a, < 1. 
The unconditional variance is then ag/(1 — o4). The fourth momentis finite 
if 3a? < 1andthen the kurtosis is ky = 3(1 — a2)/(1 — 3a%), which exceeds 
3 for all о; > 0. For the non-trivial case a, > 0, there will always be some 
integer j for which E[X?U* ] is infinite for all integers k > 0 and so these 
moments do not exist (Engle, 1982, p. 992). The series of squares 
5, = (X, — м)” and realized values s, = (x, — и)? satisfy 


E[S,| s] = ay + 045,4, 


using (3.6.3). Now, a stationary AR(1) process defined by A, = oç + a4 A, , + 
U, has А а ,] = о, + оңа, апа autocorrelations a7 (Section 1.9). This 
suggests (5,) also has autocorrelations от, providing the variance of S, 
exists, i.e. За? < 1. This result is proved at the end of the chapter, in 
Appendix 3(A). 

Fora general ARCH(p) process, (X,), define an associated AR(p) process 
by 


p 
A — > GA, _; + U,. 


The same innovation U, is used to construct both X, and A,. Then, (Х.) is 
stationary if, and only if, (A,) is stationary and then 


var (Х,} = ay/(1 — (о) + ay + --- + ор), 


see Engle (1982, Theorem 2). Assuming stationarity, let z£ = var (A). Then 
it is shown in Appendix 3(A) that if the kurtosis exists it will equal 
3/3 — 202); it appears the kurtosis exists if a4 < 1.5. It is also shown in the 
appendix that (5,) апа {A,} have the same autocorrelations, SO p, s = р, д 
for all 7, whenever the fourth moment of X, is finite. This result provides a 
constructive method for finding the autocorrelations of the squares of a 
general ARCH process. It is not possible to derive the autocorrelations of 
M, = |X, — ш. 


78 Modelling financial time series 


As V; = var (X,|X,_,, ...) depends on (X,., - ш) = V j|U? ;,1«ip,it 
follows that (V,) and (U,) аге not stochastically independent. To prove 
this, let j be the least positive integer for which o; > 0. Then E[V;U7_,;] – 
ЕТУ ЕГШ? ;] can be shown to equal 2a;E[V7]# 0, disproving stochastic 
independence because V, and U,_; are not independent random variables. 


Caused by past absolute returns 


The conditional variances of X, need not be a simple linear function of past 
squared returns. Another plausible formulation is 


p 
X = р = Ë s > a;l X-i — «du (3.6.4) 
1-1 


with a > Oandalla, > 0. The mathematical properties of (3.6.4) are largely 
unknown. However, writing M, = |X, = u|, m, = |x, — u|andé = ЕДІ, 
E[M, | m.., ..., mi] is ag + 8 > o;m,.,. This suggests that {M,} and its 
associated process A, = ó X a;A,_; + U, have the same autocorrelations, 
when they exist. It is proved that p, м = p, Ain Appendix 3(A). The relevance 
of a formulation like (3.6.4), compared with (3.6.2), can be determined by 
calculating their respective likelihoods from observed data. 


ARMACH models 


An ARCH process (X,) constructed from strict white noise { U,) will always 
be uncorrelated whilst the derived processes {M,} and {5,} will have 
non-zero autocorrelations. These theoretical autocorrelations must be 
similar to observed autocorrelations if the process is to have any chance of 
being a satisfactory model for financial returns. We have just seen that 
either the р, < or the p, m are the autocorrelations of an AR(p) process when 
{X,} follows (3.6.2) or (3.6.4). The sample coefficients г, x2, r, |, and г, 
described in Section 2.10 are almost always positive and generally decay 
slowly as т increases, but all the coefficients are fairly small. An AR(p) 
process cannot produce this behaviour for low p. However, the auto- 
correlations of a stationary ARMA(1,1) process 


A: — A1 = U, — 0U, |} <1, (3.6.5) 


are p,a = Có", with C = (1 — 66)(6 — 0) {p01 — 299 + 02)). 

This follows from equation (1.9.13) with a replaced by ф and b by -0. 
Whenever 1 > C, ó > 0, it is possible to find a positive 0 such that 
p-a = Có' and then ф > Ө. Equation (3.6.5) can be rewritten as an AR(%) 
process, cf. equation (1.9.11): 


ос 


A, EE > oi + U,, а; = (ó UV 9)0'' (i = 1). 


i=1 
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Hence a plausible class of ARCH processes for returns, based on (3.6.2), is 


© 


1/2 
X — w= Ë + (ç — 0) У OMX; — 2 U, (3.6.6) 
1-1 

for some ф and Ө constrained by 1 > $ > 0 > 0. The constraint on ф 
ensures (X,) is stationary. It can be shown that X, has finite kurtosis if 
$? + 2(0— Ф) «1. Тһе kurtosis then equals  3(1-— $?y 
(1 — 39? + 40 — 20?) and p, s exists and equals Сф”. More algebra shows 
that the conditional variances V? аге a weighted combination of И? |, 
(Ху — н)? and gè = var (X) = ay/(1 — (a, + a + +: :)): 


V? = 0V2 . + (ó — X, u? + (1 — dioi. (3.6.7) 


To find a plausible class of ARCH processes based upon (3.6.4), substitute 
бе, = (ó — 0)0 ^! so: 


PE Ë + (é$ = 0) ¥ 0”1|X. ; — «infu. (3.6.8) 
i=1 
The conditional standard deviation V, is then a weighted combination of 
Vi, [X4 — wl and wy = ЕХ, — u|] = 6og/(1 — бао + о + .. Ур 
V, = Өү, + (6 — O)|X_, — д|/8 + (1 — b) py /6. (3.6.9) 


Assuming {X,} is stationary, p, м = Сф”. 

The models defined by (3.6.6) and (3.6.8) are constructed from an 
associated ARMA process and so it is natural to then call {X,} an ARMACH 
model. There are four parameters: the mean д, a scale parameter (oy OF pay) 
and the АКМА parameters ф and Ө. 


3.7 MODELLING AUTOCORRELATION AND VARIANCE CHANGES 


All the models described in the previous sections of this chapter share two 
properties. Firstly, the realized conditional standard deviations v, are not 
the same for all days t. Secondly, the standardized returns U,, defined as 
the returns minus their mean and divided by their conditional standard 
deviations, form a strict white noise process. The first property causes 
autocorrelation among the squares S, = (X, — ш)? and absolute deviations 
M, = |X, — u|, whilst the second ensures the X, are uncorrelated. 

There can, however, be low yet statistically significant correlation be- 
tween returns on different days, a conclusion asserted by various re- 
searchers and confirmed by tests in Chapter 6. To model any such auto- 
correlation, the second property must be altered. Recall the general 
factorization of X, — д: 


Х-и = V, U, (3.7.1) 
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with several conditions on (U,) inall the models described so far, including 
these four: (a) the U, have independent and identical distributions, (b) 
E[U,] = 0, (c) var (U,) = 1, and (d) U, and V, are independent. Stronger 
conditions replace (d) in Sections 3.5 and 3.6; by (d’) {U,} and {V,} аге 
stochastically independent in the former section and, in the latter section, 
(d") (U,, Ui, ..., U)isindependentof(V,, №, 1, ..., V, j),forall i, у = 0 
and t. To obtain non-zero correlation between X, and X,+, for some positive 
lag 7, it is obviously necessary to change condition (a). It is convenient to 
retain the factorization (3.7.1), conditions (b), (c), and, whenever possible, 
(d). 

Condition (a) must be changed to allow {U,} to be autocorrelated, yet 
{U,} ought not to be very different from a strict white noise process. One 
possibility is a linear process defined by a linear combination of variables 
from a zero-mean, strict white noise process (ғ,), thus 


(а*): U, = > Ge; (3.7.2) 
1-0 

for constants 0; (perhaps all zero for і greater than some integer q), with 
0, = 1. We should then expect 6,, 65, ... to be small numbers as we have 
already seen in Chapter 2 that there is very little, if any, autocorrelation 
among returns. In (3.7.2), let T, = U, — &. Then (U,) is almost strict white 
noise because {U,} is the sum of the strict white noise process (e,) and the 
autocorrelated process (T,) and the variance of T, is much less than the 
variance of e,. In fact var (T,)/var (sj) is the small number X 02, summing 
over positive /. 

Note that (a*) implies X, = u + Vie, + V,T,. The term Ve, is unpredict- 
able, whereas the product V,T, is the reaction on day tto information first 
known on previous days t — i, i > 0. High market activity, represented by 
a high realization v,, will then imply a greater quantity of slowly interpreted 
information, represented by v, T,, than occurs during periods of low market 
activity. This does not appear unreasonable. 

More generally than (a*), condition (a) can be replaced by 


(a**): {U,} is almost strict white noise. (3.7.3) 


Statement (a**) means there is a decomposition U, = e, + T, with {e,} 
strict white noise and var (T,)/var (€,) a small number for all t. 

The revised conditions about ( U,) enable modelling of an autocorrelated 
process (X,), having stochastic conditional standard deviation, without 
altering the interpretation of v,. The realization v, remains a measure of 
market activity on day t. There are doubtless other ways to model a process 
characterized by weak autocorrelation and changes in conditional standard 
deviation. The method outlined in this section appears to be satisfactory. 
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We retain the relationship X, — и = V,U,. Given a realization v, of V, the 
conditional variance var (X,|v,) will be exactly v? for some processes and 
very close to v? for others. We will always say when var (X, |м) is not exactly 
v2: this occurs once in this section and also in Section 7.2 and is а 
consequence of v, telling us something about the conditional distribution 
of U,. Because var (X,|v,) is always very close to v}, we continue to call v, the 
conditional standard deviation although on occasions it is just a very good 
approximation to the exact conditional standard deviation. 

For stationary processes, the autocorrelations of (X,) and (U,) are not 
identical although they can be very similar. The results presented in earlier 
sections about the autocorrelations of (S,) and {M,} are no longer exact 
when {U,} is autocorrelated but they are excellent approximations. We 
now consider the consequences of autocorrelated U,, separately for each 
of the variance models introduced in Sections 3.5 and 3.6. 


Variances not caused by returns 


For {V,} defined by a functional relationship V, = f(V,_,, М-2,..., т) and 
{U,} strict white noise, it was shown in Section 3.5 that it is possible to 
justify the assumption that (V,) and {U,} аге stochastically independent. It 
is reasonable to retain this assumption when { U;} isalmost, but not exactly, 
strict white noise. Then var (X,|v,) is v2. 

Suppose both {V,} and {U,} are strictly stationary processes and, as 
before, all fourth order moments are finite. Applying the assumption of 
stochastic independence, var (X,) — F[V?] and 


EIX: — Ш) (Хе, — д)] = F[V,V,.;]E[U,U,,.]. 
As U, has unit variance, E[U,U,,,] equals the autocorrelation p, u and so 
рх = P, uEIV М.Е. (3.7.4) 
Also 
EIV: Val = EIVI? + р, var (V) 
for all 7, including 7 = 0, and thus it can be deduced from (3.7.4) that 
p. у < p, xfp, u < 1. (3.7.5) 


It follows that p, x and p, are very similar whenever the V, are highly and 
positively autocorrelated. 

In Section 3.5 the autocorrelations р,5 and р, м were shown to be 
constants times р, у: and p, v respectively (equations (3.5.4) and (3.5.5)), 
assuming {U,} was strict white noise. When the U, are autocorrelated, 
E[S,] = E[V?] and 
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EISS] = ЕТУ? VL ]ELUTUT..]. 
Let k = E[U1] so that var (U?) = k — 1. Then, 
ELU2U?,.] = E[U2 + cov (02 72,,) = 1 + (k — 1)p, us. 


Using these equations it can be deduced that cov (S,, 5;,,) divided by 
var (5j) is 


рө = р, у: [var (Vi)/var (5)] + cpu» (3.7.6) 


with c, = (k — 1)E(V2V2,, ]/var (S). As 0 < E[V2V2,,] < ЕГУ and var (S) 
> (k — 1)E[V%] it follows that 0 < c, < 1. The upper bound can be attained 
if every V, equals the same constant with certainty. 

The additional term c, p, uz in equation (3.7.6) is negligible when series of 
financial returns are modelled. For example, if {U,} is Gaussian then 
р.и» = р? u (Granger and Newbold, 1976) which will be very small com- 
pared with p, v: and p, s. Empirical estimates will confirm this conclusion. 

Using similar methods applied to {M,} it can be shown that 


Pim = 82р, v [var (V,)/var (Mj] + а, р, (3.7.7) 


with 0 < d, = (1 — &?)E[V, V; l/var (M) < 1. The extra term due to 
autocorrelations among the U, is again negligible if {U,} is almost strict 
white noise. 


Variances caused by returns 


In Section 3.6, X, — ш was modelled by the product of a strict white noise 
variable U, and a conditional standard deviation determined by past 
returns V, = f(X, 4, Х-2,...). The process (Uj) сап be redefined as almost 
strict white noise in several ways. Unfortunately, none of them provide a 
complete set of mathematical results. The most convenient way to allow 
the X, and U, to be autocorrelated is to suppose 


> 604. ш) = Ме or Х-и = Y, 6 V-i&-i 
і50 i=0 


for constants ¢;, 6 (фо = 0 = 1), (&) zero-mean strict white noise, and 
V, = f(%_1, X-2, ...) as before. The moving-average representation can 
be rearranged as 


X, — w= V,U, U, = & + Y 6Gs(V.V. (3.7.8) 
i=1 
Then (0) is almost strict white noise assuming, as we do, that 0? is small, 


summing over positive i. We retain the assumption that U, has unit variance 
and then e, has variance slightly less than one. Given a past history 
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h- = {x alli > 0), every product inside the summation term of (3.7.8) 
can be calculated and consequently var (X,|/,.,) = var (X,|w) = v; var (Er). 
This means v, is now a constant times the conditional standard deviation. 

The autocorrelations of (X,) can be found by defining e; = e, V, and 
noting that {є, } is white noise with X, — и = = 6e€;-;, so that 


Dp. x = > вв. 5 07. (3.7.9) 
i=0 i=0 


When the function f is sufficiently symmetric in past innovations e to make 
ты = & V/V,,; and т.,) = Ettr V, / Мата) uncorrelated for all i, j, + > 0, 
the autocorrelations of {U,} are 


ри = У, 00... Eleti V? iAV, Va). (3.7.10) 
і-0 

Тһе expectation іп (3.7.10) cannot Бе simplified for realistic functions f. 

Nevertheless, it is seen that p, x and p, u will be similar if the V, are highly 

and positively autocorrelated. 

It is impossible to calculate the autocorrelations of either {S,} or (Mj) 
when the U, are autocorrelated, even for elementary ARCH specifications. 
However, these autocorrelations are almost the same as those of the 
uncorrelated process obtained by setting 0; = 0 for all i > 0. For example, 
if v? is a linear function of p terms (x,_; = м), 


p 
EIS |h] = [% + >; ais. JELU? ha): 
i=1 


As {U,} is almost strict white noise and var (U,) = 1, it follows that 
E(U |1,1] is always close to 1 and hence the constants 6; are unimportant in 
the calculation of the autocorrelations p, s. 


3.8 PARAMETER ESTIMATION FOR VARIANCE MODELS 


It will be assumed in the rest of this chapter that all stochastic processes are 
stationary. Models can then be fitted and their parameters estimated. The 
stationarity assumption will be assessed primarily in Chapter 4, by compar- 
ing forecasts based on stationary models with forecasts expected to do well 
for non-stationary processes. 

A realistic stationary model for the conditional standard deviations (V,) 
will have at least three parameters, two to specify the mean and variance of 
V, and at least one to model the autocorrelations. Parameters can be 
estimated by matching theoretical and sample moments. Let uv = ELV,], 
оф = var (V) and ë = E{|U,|]. Then, by making our usual assumption that 
V, and U, are independent, 
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E(M,] = E[V,]JE[|U,|] = дё (3.8.1) 
and E[S,] = ЕМЕ] = wy + oç. 
From data (хі, х, ..., Xa}, the sample averages 
х-15 x, SUS ects and s = Í >; (x, — x) 
n n п 
сап Бе calculated and hence the estimates 
By = т/б, of = 5 – (т/б)? (3.8.2) 


are provided by using m and S to estimate E[M,] and E[S,] respectively. It 
should be noted that estimates for the process { V,} require the distribution 
of U, to be known or assumed. In particular, ó is needed in the equations 
(3.8.2). Applying our usual assumption that U, — N(0,1), ó is approximately 
0.798. The estimates of uy and ay are given in Table 3.2, for all the forty time 
series. 

Many series have low values for the ratio Av/óv showing that V, does not 
have a normal distribution. For example, if uy/ay = 1.65 and V, is normal 
then 5 per cent of the observations v, are expected to be negative and this 
is unacceptable as V, models a positive quantity. Clearly V, should have a 
right-skewed distribution. 

To estimate an autocorrelation model for (V,) more assumptions are 
needed. We consider the product and ARCH processes separately. 


3.9 PARAMETER ESTIMATES FOR PRODUCT PROCESSES 


Suppose, as in Section 3.5, that (X,) is a product process and log (М) ~ 
N(a, 82). The parameters а and 8 cannot be estimated by the method of 
maximum likelihood because the multivariate density of Xj, X, ..., X, 
cannot be evaluated. Simpler methods are necessary; the methods 
described are the only ones at present available. As E[V,] = exp (о + 48°) 
and E[V7] = exp Qa + 282), 


В? = log (ELVIVELV,) = log (и  opue)- 


This suggests estimating 8? by substituting йу and ôy for uy and ov. Then, 
from (3.8.2), 


Ё? = log (6?s/m?). (3.9.1) 


Also, = = log (F[W]*/VE[V:]) and again replacing шу and oy by their 
estimates gives 


& = log (т7(62/5)). (3.9.2) 


The estimates ё and ñ are presented in Table 3.2. 
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Table3.2 Parameter estimates for variance models 


[m бү & B 

Spotseries 

Allied 0.0167 0.0080 —4.197 0.454 
Alcoa 0.0158 0.0076 —4.249 0.457 
Am. Can 0.0117 0.0054 —4.539 0.438 
ATandT 0.0092 0.0041 —4.778 0.425 
Am. Brands 0.0121 0.0058 —4.518 0.454 
Anaconda 0.0201 0.0122 —4.064 0.560 
Bethlehem 0.0157 0.0067 —4.236 0.407 
Chrysler 0.0221 0.0105 —3.917 0.453 
Dupont 0.0125 0.0050 —4.456 0.383 
Kodak 0.0145 0.0059 —4.309 0.393 
C. Electric 0.0141 0.0061 —4.344 0.413 
G. Food 0.0139 0.0062 -4.371 0.430 
G. Motors 0.0126 0.0059 —4.472 0.445 
G. Telephone 0.0159 0.0061 -4.211 0.371 
Harvester 0.0150 0.0067 —4.287 0.424 
FT30 0.0128 0.0019 —4.371 0.147 
Gold 0.0157 0.0098 —4.320 0.575 
Silver 0.0158 0.0104 —4.328 0.599 
Copper 0.0158 0.0097 -4.311 0.566 
Lead 0.0171 0.0109 —4.239 0.582 
Tin 0.0105 0.0073 —4.755 0.629 
Zinc 0.0158 0.0129 —4.400 0.713 
£/$ 0.0050 0.0032 —5.479 0.595 
Futures series 

Corn (12) 0.0108 0.0072 —4.706 0.604 
Corn (6) 0.0101 0.0068 —4.787 0.614 
Corn (3) 0.0101 0.0068 — 4.780 0.614 
Cocoa (12) 0.0168 0.0087 —4.208 0.490 
Cocoa (6) 0.0177 0.0082 -4.132 0.440 
Cocoa (3) 0.0183 0.0087 —4.101 0.452 
Coffee (12) 0.0170 0.0148 —4.356 0.751 
Coffee (6) 0.0163 0.0137 —4.385 0.733 
Coffee (3) 0.0165 0.0143 —4.383 0.748 
Sugar (12) 0.0197 0.0126 — 4.099 0.586 
Sugar (6) 0.0192 0.0117 —4.113 0.561 
Sugar (3) 0.0195 0.0121 —4.101 0.571 
Wool (12) 0.0092 0.0095 —5.049 0.851 
£/$ (6) 0.0052 0.0030 -5.398 0.530 
ОМ (6) 0.0054 0.0029 -5.322 0.454 
SF/$ (6) 0.0068 0.0027 -5.072 0.390 
T-bond 0.0100 0.0016 —4.621 0.160 
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Any two sample moments can be used to estimate a and B. The selection 
of m and s may appear arbitrary but it has the advantages of analytic 
convenience and avoiding high order moments. An alternative would be to 
calculate the second and fourth moments, then estimate 8 using the 
sample kurtosis and equation (3.5.6). Afterwards, using the estimate of B, 
а can be estimated by matching theoretical and sample second moments. 
However, kurtosis estimates are relatively inaccurate so it is preferable to 
use lower order moments. The recommended estimates à, B are non-linear 
functions of unbiased estimates m, s and hence à and ñ could be biased, 
although any bias will be small for large samples. 

Given & and Й, the distribution of V, is estimated to have median value 
exp (à) with 95 рег cent of the observed v, predicted to be between 
exp (& — 1.968) and exp (à + 1.96). For the longest series, the sugar 
futures returns (12 months used per contract), à = —4.099, ñ = 0.586 and 
the 95 per cent interval is from 0.0053 to 0.0523 with median at 0.0166. 


Lognormal AR(1) 


The autocorrelations of the process {log (V,)} will depend on certain 
parameters. In Section 3.5 it was claimed that the simplest and most 
appropriate stationary model for the logarithms of the V, is the AR(1) 
process, which has autocorrelations ф *. We will now consider methods for 
estimating ф. At the present time it does not seem helpful or necessary to 
investigate processes with more parameters than the AR(1). However, the 
methods illustrated for AR(1) could be applied to higher-order auto- 
regressive processes, although much more computer time would be 
required. 

A set of sample autocorrelations r, ,, г, „ for several т, say lags 1 to K, 
provides information useful for estimating ф. From (3.7.6) and (3.7.7) it can 
be seen that the autocorrelations of {U7} and {|U,|} make negligible 
contributions to p, s and p, м. It will therefore be assumed that {U,} is strict 
white noise when estimating ф. Then 


p.s = A(B)p,v. and p. = В(В)р,у (3.9.3) 


with А(8) and B(B) defined by (3.5.7) and (3.5.8). All the autocorrelations in 
(3.9.3) are functions of 8 and ф. This is emphasized by using the notation 
рту? = p(t, V^, B, $) and Prv = p(T, V, B, Ф). 

Three methods for estimating ó have been evaluated. All of them seek an 
accurate match between sample and theoretical autocorrelations. Sample 
autocorrelations are calculated as 


rl, = n(n — kr) УФ — 8s... — 5) / Y (8, = 5)? (3.9.4) 
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and r; m is defined similarly, k being either 1 for a spot series or the number 
of contracts used for a futures series. The factor n/(n — kr) is included to 
reduce bias in the estimates; remember, from Section 2.9, that the 
numerator summation in (3.9.4) excludes those times tfor which time t + 7 
refers to a different contract. The discrepancies between sample and 
theoretical autocorrelations will be measured by statistics based on 


n 


K 
т= 


K 
[ris - ps? and n Y [rim Pml? 
1 т=1 
for some number of lags K. The number of returns n is included in order to 
simplify comparisons between series, since the expected magnitude of a 
term like (r7, — р, s)2 is the variance of an autocorrelation coefficient and 
so proportional to 1/n when ф is estimated perfectly. 
The methods are now described. It will be argued from the empirical 
results that the third method is best. 


Method 1 
The first method substitutes the estimate |, given by (3.9.1), for 8 and then 
chooses ¢ to minimize either 


K 
Еф) = n X lr, - А рт, V^, B, Ф)? 
т=1 
or 
K ^ ^ 
Ғ,(ф) =n > [r; m ы В(В)р(т, V, B, ФУР. 


Тһе optimizations сап be performed Бу calculating these objective func- 
tions for various ó. | have calculated F, and Р, for ф = 0,0.01,0.02, ...,0.5, 
0.505, 0.51, ..., 0.9, 0.901, 0.902, ..., 0.995. In all the calculations Kwas set 
at 50. 


Method 2 

A second method is intended to avoid problems when f is a poor estimate 
and so the estimated constants А(8) and В(@) are inappropriate. Both Band 
ф are estimated, by minimizing 


P(B, $) = > [rs = А(В)р(т, V2, В, p) 


or 


F,(8, 6) = пУ [rim – В(В)р(т, V, B, ФР. 


These optimizations were performed using an iterative technique which it 
is not necessary to describe. It suffices to note that considerable computer 
time was needed to minimize either F; or F4. 
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Method 3 
The optimizations required by the second method can be simplified by 
noting the approximations 


p(t, V B, $) = plt, V, B, $) = ф" 


discussed earlier in Section 3.5. The approximations are best for small 
values of B. A third method merely minimizes 


F(A’, ф) = nM Iri, - АФФ 
or 

F.(B', $) = n Y [rim — BOY. 
These minimizations are straightforward. For example, the best A' when $ 
equals dy is, using calculus, X r} 64/Z фо". The possible values considered 
for ó were identical to those considered for the first method. 


Given the estimates А and ¢ minimizing Ғ;, another estimate of В, 
denoted by ĝ,, is given by solving A(B,) = А; in fact 


B, = 0.5(log (1 — А)/(1 — 3A)]}". 


Likewise, if É and é minimize F,, an implied estimate &, solves B(B,,) = В 
to give 


B, = {log [201 — By — тӘ). (3.9.5) 


Results 


The second and third methods give almost identical estimates of ф. On 
average, the absolute difference between the two estimates derived from 
the same set of sample autocorrelations is less than 0.001. Consequently, 
no further results are given for the second method. 

When the minimum values of F, are compared with the minimum values 
of F; it is found that the minima for the third method are less than those for 
the first method, often considerably so. The same conclusion holds when 
the minima of F, and F, are compared. Of course, the third method 
minimizes over an additional parameter but the improvement thereby 
obtained does appear to be significant. The third method is therefore 
preferred and a further reason for preferring it will be demonstrated in 
Chapter 4. 

Minimizing either F; or Fç gives an estimate ¢ of ¢ by using the preferred 
method. The minimum value of Fç is less than the corresponding minimum 
value of F; for 37 of the 40 series. Thus the r, „ would appear to have smaller 
standard errors than the r, , even though the г, m are generally larger than 
the r,s. It is therefore best to use the r, m to estimate $ by minimizing Fç. 

Table 3.3 summarizes the estimates B and ф obtained by minimizing 
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Table3.3 Estimates for a product process 


Byminimizing F, Minimum values of 
B Bm ф қ F, 

Spotseries 

Allied 0.113 0.275 0.981 59.5 266.4 
Alcoa 0.117 0.281 0.980 85.3 294.4 
Am.Can 0.076 0.219 0.977 51.2 227.9 
ATandT 0.109 0.268 0.914 86.5 111.3 
Am.Brands 0.120 0.285 0.927 84.8 114.7 
Anaconda 0.091 0.243 0.958 47.6 167.8 
Bethlehem 0.073 0.215 0.961 57.3 128.1 
Chrysler 0.086 0.234 0.984 57.0 302.0 
Dupont 0.158 0.337 0.984 56.5 79.6 
Kodak 0.147 0.322 0.983 78.3 128.7 
G.Electric 0.183 0.370 0.989 35.9 59.6 
G.Food 0.111 0.272 0.987 77.2 283.0 
G.Motors 0.118 0.281 0.987 56.9 294.3 
G.Telephone 0.098 0.253 0.991 43.9 185.2 
Harvester 0.101 0.257 0.979 68.8 247.8 
FT30 0.125 0.292 0.976 45.1 231.4 
Gold 0.280 0.501 0.983 130.3 153.6 
Silver 0.322 0.562 0.985 62.5 66.0 
Copper 0.212 0.408 0.985 91.2 399.9 
Lead 0.264 0.478 0.979 81.2 167.7 
Tin 0.305 0.537 0.981 126.1 176.8 
Zinc 0.357 0.617 0.989 61.9 125.8 
£/$ 0.197 0.388 0.987 73.2 323.8 
Futures series 

Corn (12) 0.383 0.661 0.994 38.6 166.6 
Corn (6) 0.413 0.716 0.993 46.9 349.4 
Corn (3) 0.463 0.832 0.995 557.7 2449.9 
Cocoa (12) 0.244 0.451 0.984 97.5 111.1 
Cocoa (6) 0.204 0.397 0.987 117.4 131.9 
Cocoa (3) 0.164 0.345 0.989 265.2 355.5 
Coffee (12) 0.278 0.498 0.988 121.2 536.9 
Coffee (6) 0.298 0.527 0.988 141.7 402.5 
Coffee (3) 0.286 0.510 0.994 325.1 697.4 
Sugar (12) 0.318 0.556 0.984 65.6 71.5 
Sugar (6) 0.290 0.515 0.986 87.7 120.4 
Sugar (3) 0.296 0.523 0.979 102.7 130.6 
Wool (12) 0.442 0.775 0.990 64.2 87.5 
£/$ (6) 0.166 0.347 0.987 76.1 311.4 
DM/$ (6) 0.198 0.390 0.985 74.3 105.3 
SF/$ (6) 0.210 0.406 0.989 40.4 43.4 


T-bond 0.066 0.203 0.995 58.7 70.5 
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F,(B’, ф). The table presents В, the implied estimate В, then ф, F,(B, ф) 
and, for comparison, the minimum value of Р,(ф). It can be seen that the 
least estimate ¢ for the 23 spot series is 0.914, whilst for the 17 futures series 
the least ф is 0.979. The median spot estimate is 0.983, the futures median 
is 0.988. Thus it is clear that either $ is nearly 1 when the model is 
appropriate or the model is inadequate with the process (log (V,)} prob- 
ably non-stationary. We will conclude our discussion of the stationarity or 
otherwise of the conditional standard deviations in Chapter 4. The esti- 
mates for futures series taking three months' data from each contract are 
inaccurate because there are few terms in the numerator of r, „ when 7 is 
near the maximum lag used, namely 50 trading days. 

A comparison of the two estimates of B, i.e. Ê given by (3.9.1) and Êm 
given by (3.9.5), shows that Й,, is usually less than ñ and the differences are 
not negligible. It is possible that at least one of the estimates is biased or the 
assumed model is inaccurate, for example the distribution of the U, could 
be slightly non-normal. 

Figure 3.1 shows the actual autocorrelations г; „ апа the fitted approxi- 
mation to the theoretical autocorrelations, i.e. Bd’, for the longest of the 
40 series. These sugar returns have Б.В, $) = 65.6 and the fitted curve 
appears to fit well. A similar graph was presented in Taylor (1982a) based 
upon the г, , for a slightly shorter sugar series. The article also considered 
a more detailed model in which a = E[log (V,)] can vary from contract to 
contract. 


3.10 PARAMETER ESTIMATES FOR ARMACH PROCESSES 


Initially consider an uncorrelated, finite-order, ARCH process, as defined 
in Section 3.6, so V, is a function of X; — ш (1« i « p) and p + 1 
parameters a, (0 < / < p). Denote by о the set of parameters и, ao, 04, ..., 
ay. The likelihood function for n observed returns is 


L(x, Х,..., ха |е) = f(x,|@)f(x2|h, e) Rm f(x, [1-3 e) 


with f(x, |1,1, œ) denoting the conditional density of X, given the previous 
observations /, 4 = (xy, x), ..., х1} and the parameter vector w. These 
conditional densities can be described for t > p by finding v, and then 
using the appropriate normal density: 


бх la, e) = fxiv) = (VQm)w) ' exp [-36« — ш/к). 
The v, depend on о. The maximum likelihood estimate Ф, for observations 


p + 1to n, maximizes 


n 


p(w) c I! fox Es w) 


t=p+1 
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Figure 3.1 Actual (solid curve) and fitted (dashed curve) autocorrelations for the 
absolute returns when a product process is estimated from the longest sugar 
futures series 


or, equivalently, maximizes 


n 


log L,(w) = —3(n — р) log (27) - У log (v) 


t=p+1 
-% У (xt — иш) Ум. (3.10.1) 
t=p+1 
If the process is autocorrelated, equation (3.10.1) provides an acceptable 
approximation to the likelihood function, so it will be assumed the process 
is uncorrelated when estimating o. 

At the end of Section 3.6 it was proposed that a suitable ARCH process 
could be defined by a,,, = 6(a;) foralli > 0. There are then few parameters 
in the resulting, infinite-order process, called an ARMACH process. Two 
possible specifications of an ARMACH process are summarized by equa- 
tions (3.6.7) and (3.6.9). Both are considered. For either specification we 
need to estimate и, a scale parameter (F[S,] or E[M,]), Ө, and a parameter 
ф; recall ф now denotes either p,.1,5/p,.5 Or p, + fp, м for all 7 > 0. 
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The following estimation procedure for spot series is recommended as 
being practical and based on the likelihood function. First, estimate и by x 
and the scale parameter by s or m as appropriate. Second, define an 
estimate ў, of v,, byeitherv, = VSorv, = т/б, depending on the specifica- 
tion chosen. Then, fora particular $ and 0, usea recursive equation, (3.6.7) 
or (3.6.9), to obtain ú, 2 < t « n). This is done by replacing V, by ú, 
X4 = p by x4 — X, ox by sand им by min the recursive equations. The 
error v, — 9, willbenegligibleforlarge enough t, i.e. the initial error v, — Y 
will no longer be important. Thus the function given by (3.10.1) can be 
maximized for some integer p, with v, replacing v,. This optimization 
provides estimates ф and б, expected to be close to exact maximum 
likelihood estimates. The choice p — 20 has been made for all the calcula- 
tions. 

Futures series require a more complicated procedure. Suppose the 
series uses returns from k contracts, ending on day T; for contract í and so 
beginning on day T; + 1 for contract i + 1. For the first contract, v4, can be 
obtained by applying the method for spot series. For subsequent contracts, 
we need a method for defining ў, when t = T; + 1 and 2 < i < k. Two 
methods have been tried; each uses 20 returns on contract í from times 
Tı — 19 to T; , inclusive. Method A sets V, at time t= T; , — 19 for 
contract / equal to the value of V, at the identical time for contract i — 1. 
This method then uses (3.6.7) or (3.6.9) as before, with s and m being the 
usual averages over all times from 1 to n inclusive. Method B treats each 
contract separately. Contract / provides averages s; and m; by averaging 
over times 7; , — 19 to T; inclusive and these averages respectively replace 
оў in (3.6.7) and wy in (3.6.9). At time t = T; 4, — 19 the initial value for 
contract i is either V, = Vs; ог m,/5, depending on the specification of the 
ARMACH process. Method А assumes that at any moment in time all 
contracts have similar conditional standard deviations, whilst method B 
avoids this assumption. For either method the first return from contract i 
appearing in (3.10.1) has t = 21 when í = 1 and t = T; + 1 when / > 2 
with the last return from contract í appearing in (3.10.1) having t = T;. 


Results 


The ARMACH specification based on (3.6.6) and (3.6.7) has р, ; = Сф” 
whilst the alternative specification, (3.6.8) and (3.6.9), has p,m = Сф”апа, 
in each case, Cis a function of ó and 0: 


C = (1 — 0ф)(ф — 0)/{Ф(1 — 200 + 0?)). (3.10.2) 
The log-likelihood, (3.10.1), can be maximized either over 1 > C > 0, 
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1> ф > 0 or over 1 > ф > 8 > 0 and the former choice is made here. 
Given C and ó it is possible to obtain 0 by rewriting (3.10.2) as 


6- pe+1=0, D= {1 + 4671 – 2O}/{6 – C), (3.10.3) 


and then the relevant solution is 0 = (D — V(D? — 4)}/2, taking the posi- 
tive square root of D? — 4. 

A search routine was used to seek the numbers С and ф maximizing 
(3.10.1). Estimates accurate to at least two decimal places were sought. 
When necessary a constraint ф x 0.995 was applied. Considerable com- 
puter time was needed for the optimizations. 

Table 3.4 presents estimates C, ф, and ó for the ARMACH specification 
based on past squared returns. This specification generally has a far higher 
maximum likelihood than the alternative specification based on absolute 
returns. The final column of Table 3.4 lists the maximum log-likelihood for 
the squares specification minus the maximum for the alternative specifica- 
tion. All the futures estimates are for method A, shown in Chapter 4 to be 
preferable to method B. 

Several estimates ф equal the upper limit of 0.995 and many С are 
suspiciously high, as Có "far exceeds the sample autocorrelations r, ,. Such 
estimates cast considerable doubt upon the general descriptive value of 
models based on the ARCH concept. 


3.11 SUMMARY 


The non-trivial autocorrelation found in series of absolute returns 
and series of squared returns can be explained by changes in either 
the variances or conditional variances of the returns. Changes in 
the level of market activity are probably the cause of the variance 
changes. 

Two contrasting models for returns incorporating changes in con- 
ditional variances have been discussed, firstly product processes and 
secondly ARCH processes. There is sufficient autocorrelation among 
the absolute and squared returns to estimate the parameters of special 
versions of these stationary models, i.e. the lognormal, AR(1), product 
process and the ARMACH processes. Investigations of stationarity 
and the selection of a best stationary model (when appropriate) require 
more empirical results and these follow in the next chapter. The param- 
eter estimates show that any successful stationary model must have 
very high autocorrelation in the series of conditional standard 
deviations. 
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Table 3.4 Estimates for ARMACH models 


Basedon process S 
Difference in 
c ф 6 log-likelihood 

Spotseries 

Allied 0.16 0.970 0.894 16.7 
Alcoa 0.09 0.995 0.967 30.2 
Am. Can 0.08 0.980 0.939 21.2 
ATandT 0.20 0.725 0.599 13.8 
Am. Brands 0.27 0.790 0.621 21.5 
Anaconda 0.10 0.875 0.804 47.5 
Bethlehem 0.10 0.940 0.876 27.6 
Chrysler 0.08 0.985 0.948 20.1 
Dupont 0.21 0.955 0.850 9.1 
Kodak 0.16 0.970 0.894 2.0 
G. Electric 0.11 0.990 0.950 742. 
С.Ғооа 0.09 0.995 0.969 5.9 
G.Motors 0.11 0.975 0.920 7.3 
G. Telephone 0.07 0.990 0.961 6.5 
Harvester 0.12 0.985 0.936 10.7 
FT 30 0.14 0.975 0.910 1.0 
Gold 0.33 0.945 0.785 15.8 
Silver 0.32 0.985 0.887 9.6 
Copper 0.42 0.985 0.862 —0.6 
Lead 0.52 0.990 0.863 19.1 
Tin 0.73 0.995 0.848 11.6 
Zinc 0.62 0.990 0.834 8.5 
£/$ 0.34 0.970 0.836 53.2 
Futures series (Method A) 

Corn (12) 0.57 0.995 0.891 10.9 
Corn (6) 0.51 0.995 0.903 16.5 
Corn (3) 0.54 0.995 0.897 21.4 
Cocoa (12) 0.32 0.985 0.887 13.0 
Cocoa (6) 0.28 0.985 0.896 7.7 
Cocoa (3) 0.27 0.985 0.899 11.9 
Coffee (12) 0.76 0.995 0.837 4.6 
Coffee (6) 0.80 0.995 0.819 2.2 
Coffee (3) 0.78 0.995 0.828 —7.1 
Sugar (12) 0.48 0.990 0.872 22.0 
Sugar (6) 0.49 0.985 0.843 7.7 
Sugar (3) 0.48 0.985 0.846 10.7 
Wool (12) 0.72 0.995 0.852 12.8 
£/$ (6) 0.23 0.970 0.871 7.8 
DM/$ (6) 0.36 0.985 0.877 2.1 
SF/$ (6) 0.38 0.990 0.895 0.7 


T-bond 0.11 0.990 0.950 0.3 
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APPENDIX 3(A) RESULTS FOR ARCH PROCESSES 

Without any loss of generality, assume џи = 0. In this appendix (U,) is 
standardized, Gaussian, white noise. Firstly, suppose we are interested in 
the kurtosis and autocorrelation properties of the ARCH specification: 


і-1 


p 1⁄2 
X = É + > лї U.. 


Writing Y, = X,/Voo, Y, is (1 + Хо У? 72), Now X, and Y, have the same 
kurtosis, and the processes (X?) and (Y?) have identical autocorrelations, 
always assuming these numbers exist. Therefore we will assume œ = 1. Let 
S, = X, so 


p 
S, = í +5 «5. ut. (3.A.1) 
1=1 


Assuming E[X?] is finite, S, will have finite variance and mean ps = 
1 — (o, + +++ + а, )). Equation (3.4.1) can be rearranged to give 


p 
5, ~ Ms = uis a; (Sy; > us) + us (U2 yes, 1). (3.A.2) 


i=1 


Now U is independent of S,_, — и; for all j > 0. Thus, when both sides of 
equation (3.A.2) are multiplied by S, , — us, т > 0, and expectations are 
taken, 


p 
EIS, — usY(S-, — м$)] = > %Ё[(5—; - ausSi- — us). (3.АЗ) 


i=1 


Dividing all the expectations by var (Sj): 


p 
Ds = M epis for all 7 > 0. (3.4.4) 
ігі 
Equations (3.А.4) are the well known Yule-Walker equations and uniquely 
determine the p, s. For the associated zero-mean AR(p) process 


p 
A = X аА; + U, (3.A.5) 


і-1 


р 
E[A,A,_,] = > а ЕТА, A] 


i=1 
and so 
p 
Pra = > O;D.—iA, 


i21 


hence p, = p, д for all т. 
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To find the kurtosis of the X,, it is helpful to consider o2 = var (А,). 
Squaring both sides of (3.4.5) and equating the expected values of the two 
sides: 


р-1 


р p 
02 = 1 + a(S at) + 202 > > аад. (3.A.6) 
ігі ігі y=i41 

Let y = Е[52], so ky = Е[ХФ]/Е[Х?]? = y/u? апа var (5) = y — м2. Then 
squaring each side of (3.A.1) and taking expected values: 


p p 
y= af + (3 at) t2u, > a, + 2 У У a EISS}; 
іг ім j 


i<j 
(3.A.7) 


Now E[S,_;5;-;] = cov (5,_;, 5,2) * ш = (у – ТУН + ui. As Pas = 
p;-i,a, equation (3.4.6) can be used to simplify equation (3.A.7). When this 
is done, the kurtosis is found to be ky = 3/(3 — 204). Clearly of < 1.5 isa 
necessary condition for finite kurtosis. This constraint is a necessary and 
sufficient condition when р = Тапа thus с2 = 1/(1 — o2), see Engle (1982, 
p. 992). It appears probable that, for any p, o4 < 1.5 is a sufficient condition 
for finite kurtosis but this assertion has yet to be proved. 

Secondly, consider the autocorrelation properties of X= 
{ay + X aj|X,., |) U4. Again we can assume ay is 1. Let M, = |X,], so 


M, = ( + > aM, Jul. (3.А.8) 
i=1 


Then M, has mean py = ó/(1 — ё{о +--+ + ap}) and (3.А.8) can be 
rewritten as 


i21 


p 
M, — им = us e ( M,-; — Ho} + py [6 |U, — 1) 
and hence for any 7 > 0 


p 
EM, — имХМ,.-,- Ha) = ó У, ЕМ, — имХМ-, — им). 


f=1 


These equations imply {M,} has the same Yule-Walker equations and 
hence autocorrelations as the associated process A, = 6X a;A,_, + И. 


Chapter 4__ 


Forecasting Standard Deviations 


Daily returns are useful information for forecasting the standard deviations 
of future returns. This chapter compares several forecasts and recom- 
mends a convenient forecasting method. The forecasts have many uses, 
permitting better random walk tests and more accurate option pricing 
methods. 


4.1 INTRODUCTION 


Given complete information about present and past returns, denoted 
I, = (x,-;, j 2 0), we seek the best predictor of the next day's conditional 
standard deviation, denoted v,,,. The level of market activity on day t + 1 
will depend partially on information available before the day begins and, to 
some degree, /, will summarize this information. Forecasts of v,.. will be 
denoted by %,;, and, when helpful, regarded as the realized values of 
random variables V... 

The best predictor of v,,, is sought for four reasons. Firstly, to compare 
forecasts based upon different variance models and so to acquire evidence 
about the issue of stationarity and the best model when a stationary one is 
acceptable. Secondly, to produce a numerical estimate of the volatility of 
future price-changes by using their estimated standard deviation as a 
simple risk measure. Thirdly, to provide a day-to-day statistic for rescaling 
returns to give acceptable data for valid random walk tests (Chapter 5). 
And, finally, to provide the necessary estimates of future standard devia- 
tions for pricing options (Chapter 9). 

Option pricing models have several parameters and the most difficult to 
estimate is the standard deviation. The methods presented in this chapter 
are very practical. 

It is only possible to calculate the actual conditional standard deviations 
from observed returns if special and, almost certainly, incorrect assump- 
tions are made, for example that an ARCH process generates returns. Thus 
we cannot calculate forecast errors, v,,4 — #,; 1, nor an overall criterion for 
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summarizing forecast accuracy such as E (и — %,1)7. Instead, it is neces- 
sary to deduce the relative accuracy of forecasts V,,, by considering related 
forecasts of an observable series. 

Recall that the return x+, minus its expected value equals v,,4u,,4 with 
и, coming from a distribution having zero mean and unit variance. Also 
recall one of the definitions of the absolute return, adjusted for a known 
mean и: 


Maa = xa — Bl = малша. 


A forecast of т, tells us something about likely values for v,,,. As the 
forecast rfi,, increases we ought to increase V;,,,, since |u| is best 
forecast as a fixed constant because (U,) is exactly or almost strict white 
noise. Indeed, sensible forecasts /f,4 апа V,,, must be related to each 
other, established formally in Section 4.2. 

Forecasts m,,, based upon the product and ARMACH models are com- 
pared with forecasts suitable for non-stationary models in Sections 4.3 and 
4.4. These empirical results support the recommendation, made in Section 
4.5, that the most appropriate forecast V,,, is an exponentially-weighted 
moving average of present and past absolute returns. 

It is well known that it is difficult to forecast returns by a better forecast 
than the past average. Forecasts of m,,, can be significantly better than a 
simple average forecast, a consequence of the non-linear structure of the 
returns generating process. Forecasts of s,,, = m?,, are less successful 
and are briefly described in Section 4.4. 

When relevant options are traded it is possible to deduce an implied 
standard deviation from option prices. The implied value can be used to 
forecast the conditional standard deviation. These forecasts have yet to be 
compared with the forecasts presented in this chapter. 


4.2 KEY THEORETICAL RESULTS 


Uncorrelated returns 


Initially suppose the standardized random variables U, = (X, — ш)/М, are 
strict white noise and that U, is independent of V, and all X,_,, j > 0. These 
conditions are true for any realistic uncorrelated process (X, , including all 
the uncorrelated processes defined in Chapter 3 whether stationary or 
non-stationary. As 


Мы = М | Url 
it then follows that 


ЕМ, |l] = EUG HAEE UL A (4.2.1) 
= SE[V,44| 4] (4.2.2) 
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because the expected value of | U,,,| is its unconditional mean 6 whatever 
the available information /,. The U, will be assumed normal so ë = 0.798. 

A classical result in statistical forecasting states that optimal forecasts are 
expected values conditional upon the available information. Thus (4.2.2) 
suggests the optimal M,,, and V,,, are related linearly. To confirm this 
consider mean square errors, denoted MSE and defined by 


МЕ(М, A) = (М, — Ma] (4.2.3) 
MSE(V, 4) = EKV — Ya (4.2.4) 


Also let um гат = ЕМ, 1] and ду, = E[V,,.]. These means are not neces- 
sarily stationary. They provide simple forecasts having 


MSE(j 44) = var (Mis),  MSE(uy a) = var (Vu). 


Now let M,,, and V,,, be any pair of forecasts satisfying the linear 
relationship 


Mri. = 8%. (4.2.5) 


From (4.2.3), then (4.2.5), and the assumption that U,., is independent of 
Мапа 4: 


var (Mia) = MSE(A4...) = (E[A42..] — pea) 

m {EIM +] = 2E[M, „А, у] + E[M?..1]} 
25ELVi41| Ur Vil ой SEV к) Шй а 
SEV 1V1] == ЕГУ? +11 = Biter} 

82 [var (V,.,) — MSE(V,,1)]. 


il 


Defining the improvements in MSE, upon the forecasts provided by the 
unconditional means, by 


IMSE(M,,,) = MSE(u 4) = М5Е(М,,|) 
and 
IMSE(V,.,) = MSE(uy 11) — МЅЕ(, , 1) 
it follows that 
IMSE(M,,1) = 8"IMSE(V,,) (4.2.6) 


whenever (4.2.5) applies. 

An optimal forecast minimizes MSE and maximizes IMSE. Consequently, 
(4.2.6) proves that optimal forecasts of m,., and v,,, using the same set of 
information must be related to each other by (4.2.5). To find the best 
forecast of v,,, it suffices to obtain the optimal forecast Л» and then 
define ў; = rfi,4/6. Note that it is not necessary to assume the process 
{X} is stationary. 
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Correlated returns 


The proof of (4.2.6) fails when the X, are autocorrelated because then | U,,.,| 
and V,,, need not be independent, as assumed in the proof. Nevertheless, 
a near to optimal forecast of v,,, can be obtained by dividing the best rfi, 
by 8. For example, if (X,) is a product process, so that (V, and (U,) are 
stochastically independent, (4.2.1) remains true and so the optimal V,,, is 


MIEL Usal lu 


with M,,, = ЕМ, 111) being the optimal forecast of M,.,. As {U,} is almost 
strict white noise, there will be very little autocorrelation among the |U,| 
and so the conditional expected value of |U,,,| will be almost exactly 8 
whatever the information /,. 


Relative mean square errors 


It is convenient to discuss mean square errors as a fraction of the overall 
variance, so relative mean square errors are defined by 


_ MSE(M,,,) _ MSE(M,,,) 


RMSE(M,, 4) (4.2.7) 
макмал) — MSElum, 1) 
and 
RMSE(V,,,) = МӘН Va a МЕЈ (4.2.8) 


var(V,, i) Е М5Е(д ia) 


Optimal forecasts give the least possible RMSE with 0 < RMSE < 1. When 
the returns process is uncorrelated and M,,, = 5V,,, implies (4.2.6), the 
RMSE values satisfy 


1 — RMSE(V,,) = А11 — RMSE(M,,1)] (4.2.9) 


with A = var (М,,1)/162 var (V,,4)) a constant not depending on fore- 
casts. 


Stationary processes 


When, furthermore, (X,) is stationary the constant A,,, is the same number 
À for all t. Equation (4.2.9) then shows that the relative accuracy of forecasts 
?, у can be estimated from the relative, empirical accuracy of forecasts т, 
and an estimate of À. This is done in Section 4.4. 

An ARCH process has v,,, equal to a deterministic function of the 
information in /, and so it is possible to achieve RMSE(V,,,) = 0. 

A product process has À = р, у/р, м, this being equation (3.5.5). It is not 
possible to forecast v,,, perfectly and thus the least value of RMSE(V,,) is 
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positive. Later it will be noted that a typical least value is 0.3 for a plausible 
product process. At present, just note that estimates of RMSE for forecasts 
V,,, can be used to eliminate either ARCH or product processes from the 
category of feasible models for returns. 


4.3 FORECASTS: METHODOLOGY AND METHODS 


Several forecasts of т, will be compared for each of the 40 series with the 
objective of finding a best forecasting method. The best rñ,,, then provide 
recommended forecasts 9,,4 = rf4,4/6 of future standard deviations. All 
the calculations will use the definition m, = |x, — x|, x being the sample 
mean return calculated over an entire time series. 

Accurate forecasting methods are parametric and it is necessary to 
estimate the parameters found in equations for гӯ, ‚1. Misleading results 
could occur if parameters are estimated from complete time series and 
then forecasts are evaluated for the same data. It is essential to split each 
time series into two sections, estimate parameters from the first section 
and then evaluate forecasts for the remaining data. Parameters will be 
estimated from observations 1 to n, inclusive and forecasts compared for 
Observations n, + 1to n. The selection of n, mustbe subjective. Often two 
parameters need to be estimated but only one number, namely MSE, is 
estimated from the forecasts. This is the reason for choosing n, — 2n/3 for 
spot series. For k futures contracts, n, is chosen to put the first 2k/3 
contracts into the parameter optimization set; if 2k/3 is a fraction then the 
highest integer less than 2k/3 is used. 

Seven forecasts are compared for spot series. These are now described 
and followed by a discussion of the comparable forecasts for futures series. 


Benchmark forecast 


The mean of a process is a simple forecast against which others can be 
compared. Given /,, the natural estimate of the mean is 


t 
=m (4.3.1) 


This would be the optimal forecast if the returns process was strict white 
noise. 


Parametric forecasts 


On various occasions in Chapter 3 we have noted that the autocorrelations 
of {M,} can be modelled, exactly or approximately, by p, м = Кф” with 
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0 < К< 1,0 < ф < 1. These are the same autocorrelations as those of the 
АКМА(1,1) process: 


MT — им = ФМТ = uw) = € — 083 


with {¢,} uncorrelated, providing 6 is chosen to give K = (1 — ф0)(ф — 8)/ 
{Ф(1 — 20%  8?)); see equation (1.9.13) and, in it, replace a by ó and b by 
—80. To find 0 from Kand ó we must solve 


8- DA+1=0, D= {14 62 -2K40 — K)}. (4.3.2) 


Forecasts can be found if |6| < 1 апа consequently the relevant solution is 
8 = (D — ү(0? — 4)}/2, as D > 2 implies the other solution has Ө > 1. 

The optimal forecast M,,; from among the class of linear unbiased 
forecasts 


x 


Мм + > а(М,., = Hm) 
1=0 
is determined by the autocorrelations p, м. Thus {M,} and (Mf) have the 
same best sequence a;. From (1.9.15) it follows that the optimal linear 
forecast of М, . 1 is 


Miza = um + > ($ — 8)0 XM, — им) 


1=0 


= um + (b — 0)(M, — им) + Ө(М, — им). (4.3.3) 


To produce a forecast y+, it is necessary to replace py, Ф, and 0 by 
estimates. At time t > n,, иь has been estimated by rñi!), while ó, 0 have 
been estimated from observations 1 to n,. A forecast rf, , derived from 
(4.3.3) depends on the latest observation m,, the previous forecast rf, made 
at time t — 1, and the estimates, thus: 


т, = (ф — ó)m, + OM, + (1 — à). (4.3.4) 


Product process forecasts 


The product process having lognormal, AR(1) conditional standard devia- 
tions has, approximately, p,m = В(3)ф ' with B? = var (log (V,)) and B(B) 
defined by (3.5.8). Using method 1 of Section 3.9 gives @ from m and s, ó 
from the г, m and hence a 6 by setting K = B(B) and solving (4.3.2). These 
estimates and all the subsequent ones are computed from the first m 
observations only. The é апа 6 given by method 1 define the second 
forecast r^. 

It was claimed in Section 3.9 that method 3, which estimates both B and 
ф directly from the r, m, is better than method 1. Estimates Вапа ф give 6 by 
substituting B for K and ó for фіп (4.3.2). These é and 6 define the third 
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forecast Me. Comparing the accuracies of rh), and rh, provides further 
evidence about the best way to estimate ¢ for the product process. 


ARMACH forecasts 


Two special ARCH processes were introduced in Section 3.6. Equation 
(3.6.9) has V, a linear function of V,_, and М, , with p,m = Сф”. The method 
used in Section 3.10 provides estimates C, ó, and hence ó and a fourth 
forecast rfi. 

The other special process is (3.6.7) with V2 linear in V2 , and М2. Then 
5, = M? has p, = Сф". The parameters С and ф сап be estimated by the 
same method of maximum likelihood. This leads to forecasts $,,; having 
(cf. 4.3.4) 


^ 


Siar = (é — бе, + 64, + (1 — ф62, (4.3.5) 


with $7; = (s, + -+ + s,)/t. If this ARCH process is valid then v,,; must be 


МЗ, and thus the fifth forecast of т,,1 is rh, = 8 V6. 


EWMA forecasts 


The five forecasts described so far all assume a stationary process for 
returns. This causes the assumed stationary mean u to appear in (4.3.3). 
Indeed, (4.3.3) can be generalized to show that the best rfi, м, as N — ©, 15 
им, if |, is the available information. 

It is perfectly reasonable to doubt the assumption of stationarity es- 
pecially for long series. Estimates of the future mean E[M,,,] calculated 
from m,, s < t, will then be unreliable. A simple and practical way to avoid 
using им is to set $ó = 1 in (4.3.3), giving 


=> (1 — 6)Ө'М,_, = (1 — 6)М, + 0M,. (4.3.6) 


This defines an exponentially-weighted moving average (EWMA). The 
effect of a change in the mean, sometime in the past, is reduced by 
assigning most weight to recent observations. An EWMA should forecast 
well if there are occasional changes in variance such as occur in Hsu's 
non-stationary models. 

From (4.3.6), actual forecasts can be calculated using 


т, = (1 — 8) m + Érfy. (4.3.7) 
Letting y = 1 — бапа assuming м = 0, 
Шат |x| + (1 — ym, 


an equation used by the author in earlier research to calculate an estimate 
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M4, ОЁ the conditional mean absolute deviation £[M,,4]/,]. Taylor and 
Kingsman (1979) presented a method for estimating y based ona likelihood 
function. Estimates ў = 0.1 were obtained for copper and sugar returns. 
However, the estimation method is too complicated and time consuming 
to be recommended for further use. 

A straightforward way to choose 6 or y is to use (4.3.7) to calculate the 
т» as a function of 0 and then minimize 


E [т — rf, (OVP. 


Using rf, = (M, + :-: + m3/20, the squared forecast errors have been 
summed over tfrom 21 to n, — 1inclusive. The best ө, to an accuracy of two 
decimal places, then provides a Ófor use in (4.3.7) which is just (4.3.4) with 
ф = 1. This method defines a sixth set of forecasts rff, . 

The author has used y = 0.1inanumberof papers, i.e. ó = 0.9, to derive 
the rfi, 4 for a rescaling transformation used to improve random walk tests 
(e.g. Taylor, 1980, 1982b). To reassess the choice y — 0.1aseventh forecast 
rh), is defined Бу ó = 0.9 in (4.3.7). 

Forecasts 2, 3, 4, 6, and 7 all use the same updating equation (4.3.4), 
whilst forecast 5 is based on the similar equation (4.3.5). The initial 
forecasts ri are not important and have been set equal to the average m, 
over times 1 to n,. The later forecasts differ because of the variety of 
methods used to calculate % and ó. For example, the first US stock series 
has these estimates: 


Forecast $ ó 
2 0.840 0.685 
3 0.968 0.914 
4 0.905 0.819 
5 0.915 0.796 
6 1 0.960 
7 1 0.900 


The estimates ф and б are often similar for these six forecasts. Over many 
series the differences between estimates can provide a meaningful com- 
parison of the forecasting methods. 


Futures forecasts 


Futures series can be forecast either as a single series needing minor 
adjustments when the delivery date of the contract changes (method A) 
or as a collection of separate contracts, each having its own statistical 
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parameters (method B). The latter situation allows a different mean E[M, +1] 
for each contract. 

As in Section 3.10, suppose {x} is a series of returns, one per trading day, 
with returns on contract í from time t = T; + 1to t = T; inclusive. Also 
suppose there are 20 further returns available on contract ; from 
t= T; — 19 to t = T, inclusive (i > 1). Then an eighth forecast is the 
estimate of E[M,.,] based on contract í alone: 

t 
Pi) = У mi/(t+ 20- T, 4). (4.3.8) 


s=T 1-19 


Two major differences between methods А and B are their estimates of u 
and their initial values of the forecasts for the separate contracts. For úm, 


method A follows the procedure for spot series and uses rh), whilst 


method B uses rfi, , with a similar replacement of 57, in (4.3.5). 

Initial values of rh) at time t = Т), for contract i and 2 € j € 7 are 
calculated as follows. Method А sets т! at time t = T; , — 19 for contract 
i equal to the value of rfi at the identical time for contract í — 1. Twenty 
m, for contract í are then substituted into (4.3.4) or (4.3.5) to give the first 
forecast M1}, t = T, |, for the forecast evaluations. Method B simply 
averages the twenty m, for contract i from t = T; , — 19 to T, , inclusive to 
give the same forecast rfi//) for all j # 5, with a similar average of the s, for 
the initial fifth forecast. 

The estimates $, ó depend on the method, A or B, for the fourth and fifth 
forecasts, as described in Section 3.10. The estimate ó depends on the 
method for the sixth forecast because of the different initial values used 
when 8 is optimized. For the second and third forecasts ф, ó do not depend 
on the method; they are always estimated as in Section 3.9. 


Empirical RMSE 
Forecasts are compared with actual observations from time n, + 1 until 
time n. Their empirical mean square errors are 


n-1 
MSE(fi, j) = У (ma = ФИ (п- m - 1). (4.3.9) 
t=n,+1 


The natural estimate of MSE(um t1) іп (4.2.7) is MSE(rh, 1) as the first 
forecast is the estimated value of им 1 for a stationary process. Therefore 
empirical relative mean square errors are defined by 


RMSE(m, j) = MSE(m, J/MSE(rh, 1). (4.3.10) 


A value of RMSE(m, j) less than 1 will indicate that forecast j should be 
preferred to the benchmark forecast. 
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4.4 FORECASTING RESULTS 


Absolute returns 


The empirical values of the relative mean square errors have been averaged 
across the 23 spot series and across all the 17 futures series except the short 
Treasury Bond series. Average values of RMSE(m, j) are as follows. 


Futures Futures 
Forecast | Model/Method Spot method A method B 
2 Product process 0.915 0.774 0.775 
3 Product process 0.906 0.766 0.765 
4 ARMACH 0.914 0.771 0.803 
5 ARMACH 0.906 0.773 0.792 
6 EWMA 0.901 0.771 0.771 
7 EWMA 0.902 0.767 0.767 
8 — — — 0.819 


All these averages are well below 1 showing that forecasts more accurate 
than the current mean can be obtained. 

Futures series generally have far lower RMSE than spot series. Compar- 
ing the averages for methods A and B only reveals an important difference 
for the ARMACH models with method A preferred. Consequently detailed 
results are only discussed for method A. The eighth forecast is inferior to 
the parametric forecasts. This must imply that conditional standard devia- 
tions change within the lifetime of a futures contract. 

Overall averages for 39 series, using method A for the futures series, are 
as follows. 


Forecast Model/Method Average RMSE 
2 Product process 0.857 
3 Product process 0.848 
4 ARMACH 0.855 
5 ARMACH 0.851 
6 EWMA 0.847 
7 EWMA 0.847 


The average RMSE are all similar with the EWMA forecasts marginally better 
than the others. These similar averages are partially caused by the similar 
estimates $ and б used to define the different forecasts. To decide which 
model is appropriate for each series we need further results. These are 
estimates of RMSE for forecasts of conditional standard deviations and the 
empirical RMSE for forecasts of absolute returns further into the future 
than the next day. 
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Conditional standard deviations 


The empirical RMSE of forecasts ў, = rñ,,,/ó сап be estimated under the 
assumption that the correct model is stationary. From (4.2.9) it can be seen 
that an appropriate estimate for forecasts ë ,, based on т, will be 


RMSE(?, j) = 1 — АП — RMSE(A, j)] (4.4.1) 
with A estimating A = var (M,,4)/18? var (V,.,)}. As 
À = (E[M?] — Е[М,]2)/(52Е1М2]— ELM) 


a convenient estimate of À is simply 

À = (s — т?)/(8°5 — m?) (4.4.2) 
with m and s the average values of m, and s, = m? which have been 
calculated for complete series. 

The 15 estimates of À for the US stock price series range from 3.1 to 5.4 
with 12 of them between 4.0 and 5.0. The range for the 6 metals series ts 2.4 
to 3.1, for the 16 agricultural futures it is 2.1 to 4.2, and for the 4 currency 
series, 2.9 to 5.0. The UK share index series gives À — 27.7 and negative 
values of RMSE(?, j) indicating an unsatisfactory À. 

Another way to estimate À assumes the lognormal AR(1) product process 
is adequate. Then А = 1/B(f) and the reciprocal of the estimate B found 
directly from the autocorrelations of the m, can be used for À. This method 
usually gives a higher À than (4.4.2). 

Any ARCH process has RMSE(V,,;) = 0 so the estimates RMSE(ç, p 
should then be near zero. The product process considered in this chapter 
has RMSE(V,,,) = (1 — ф?)/(1 — Өф); this is correct when the approxima- 
tion р, м = Bd’is used (the proofis omitted). Using the preferred estimates 
$ and 6 implied by Table 3.3 and averaging (1 — ф2)/(1 – 69) over the 39 
long series suggests an expected average for RMSE(V, j) of 0.33 for the 
advocated product process. However, the average values of the estimates 
RMSE(v, j) using (4.4.1) and (4.4.2) are approximately one-half for each 
forecasting method. This appears to rule out ARMACH models for returns 
and suggests that in a product process { V,} could need more parameters 
than а lognormal, AR(1) model. 

Looking at the estimates of RMSE(V, j) series by series shows they are 
highly variable and probably far from accurate. Forecast 4, for an ARMACH 
specification, has an average estimate of 0.49 with 33 estimates greater than 
0.25 and 23 estimates exceeding 0.5. Similar figures occur for the alternative 
ARMACH specification. Thus these models are inadequate and more 
generally it is concluded that any ARCH model is most unlikely to describe 
returns adequately. Of the two forecasts based on the product process, 
namely forecasts 2 and 3, the best is forecast3. Theaverage of the estimates 
RMSE(V, 3) is 0.47, 10 estimates are less than 0.3, 7 are between 0.3 and 0.5, 


108 Modelling financial time series 


13 are in the range 0.5 to 0.7 and 9 estimates exceed 0.7. Using the 
alternative way to estimate À gives a lower average for the estimates 
RMSE(¥, 3) equal to 0.32; however these RMSE estimates could be seriously 
biased if the autocorrelations of V, are not close to ó * for some ф. 

The forecasts %., are clearly far from perfect. Nevertheless they are very 
useful as will be shown in Chapter 5 and subsequently. 


Two leading forecasts 


The best forecast based on a stationary model for returns is the third 
forecast. It uses ó and ó estimated directly from the autocorrelations of the 
absolute returns m, and will be called the stationary forecast. Forecasts 6 
and 7 are suitable for non-stationary variances and their RMSE figures are 
very similar. For the remainder of this section results are given forthe sixth 
forecast, now called the non-stationary forecast. 

Table 4.1 presents RMSE(m, 3) and RMSE(m, 6) for each series. Although 
the non-stationary forecast has a slightly lower average RMSE it is notice- 
able that the stationary forecast is better than its rival for most series; 
forecast 3 outperforms forecast 6 for 25 of the 39 long series. The differ- 
ences between the RMSE values of the stationary and non-stationary 
forecasts are usually very small because the forecasts are often similar 
when ф is close to 1 fora stationary model. To gather further insight into the 
issue of stationarity we consider forecasts further into the future. 


More distant forecasts 


The stationary forecast was deduced by considering an ARMA(1, 1) process. 
For such processes the optimal predictions for times t + 1and t + N, both 
made at time t, are related by a simple equation. From (1.9.16) it can be 
deduced that the best forecast of M,,,, made at time t and denoted М, y, 
satisfies 

М.м = Шм = ФМ шм), (4.4.3) 
for all N > 1 апа М, 1 is just Ka. Replacing the parameters ф and py, by 
their usual estimates gives the following equation for the stationary fore- 
cast, N days hence: 


rhy — у, = ó (rh, — fip). (4.4.4) 

The non-stationary forecast is the result of setting ф = 1 in the АКМА(1,1) 

model. This indicates that the appropriate non-stationary forecast N days 
hence is simply rfi for any N. 

For large N the stationary and non-stationary forecasts will often be quite 

different because of the factor ó `! in (4.4.4). Defining MSE(m, j, N) and 
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Table 4.1 Relative mean square errors for forecasts of absolute returns 


Forecasts one dayahead Forecasts ten days ahead 
Forecast 3 Forecast6 Forecast 3 Forecast 6 

Spot series 

Allied 0.908 0.910 0.939 0.937 
Alcoa 0.914 0.918 0.940 0.941 
Am. Can 0.949 0.951 0.963 0.967 
ATandT 0.954 0.967 0.993 0.993 
Am. Brands 0.966 0.954 0.995 0.976 
Anaconda 0.960 0.955 0.998 0.969 
Bethlehem 0.954 0.960 0.979 0.977 
Chrysler 0.940 0.951 0.964 0.971 
Dupont 0.855 0.852 0.889 0.879 
Kodak 0.864 0.858 0.912 0.903 
G. Electric 0.838 0.777 0.919 0.802 
G. Foods 0.909 0.884 0.965 0.900 
G. Motors 0.916 0.878 0.982 0.892 
G. Telephone 0.930 0.895 0.937 0.906 
Harvester 0.931 0.935 0.949 0.951 
FT 30 0.941 0.949 0.973 0.996 
Gold 0.915 0.930 0.985 1.118 
Silver 0.756 0.687 0.910 0.745 
Copper 0.851 0.856 0.891 0.908 
Lead 0.855 0.884 0.914 0.983 
Tin 0.883 0.895 0.999 1.051 
Zinc 0.961 0.972 1.036 1.075 
£/$ 0.883 0.903 1.009 1.058 
Futures series 

Corn (12) 0.614 0.605 0.667 0.635 
Corn (6) 0.574 0.555 0.659 0.604 
Corn (3) 0.565 0.547 0.637 0.585 
Cocoa (12) 0.835 0.840 0.910 0.943 
Cocoa (6) 0.831 0.837 0.871 0.915 
Cocoa (3) 0.907 0.923 0.922 0.950 
Coffee (12) 0.878 0.901 0.934 0.984 
Coffee (6) 0.744 0.768 0.813 0.864 
Coffee (3) 0.728 0.746 0.801 0.851 
Sugar (12) 0.764 0.771 0.818 0.835 
Sugar (6) 0.746 0.751 0.796 0.814 
Sugar (3) 0.750 0.752 0.817 0.833 
Wool (12) 0.653 0.651 0.668 0.668 
£/$ (6) 0.874 0.873 0.891 0.889 
ОМ (6) 0.866 0.875 0.901 0.927 
SF/$ (6) 0.927 0.935 0.945 0.960 


T-Bond 0.928 0.850 0.927 0.844 
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RMSE(rhñ, j, N) in the obvious way, by generalizing (4.3.9) for forecasts N 
days into the future, we should expected the absolute magnitude of 
RMSE(rñ, 3, N) minus RMSE(/fi, 6, N) to increase as N increases. This is 
observed for most series. Table 4.1 gives the RMSE values for stationaryand 
non-stationary forecasts when N = 10. The average RMSE of the stationary 
forecasts is now less than that of the non-stationary ones. 


Conclusions about stationarity 


From Table 4.1 it is clear that stationary models give the best forecasts for 
the UK share index, the metals excepting silver, and the UK agricultural 
futures, whilst non-stationary models are indicated for the US stocks, 
silver, and US corn futures. Currencies, spot or futures, are slightly better 
forecast by stationary models. Forecasting results for N = 25 entirely 
supportall these conclusions. 

These forecasting results cannot prove that stationary models are cor- 
rect. It should be noted that two futures series were proved non-stationary 
in Section 2.4 whilst the forecasting results suggest using a stationary 
model. Nevertheless, it is concluded that stationary models can provide a 
good approximation to the true stochastic process generating financial 
returns at various markets even when the period studied covers many 
years. 


Another approach 


All the results given here are for forecasts of т), at some future time 
t+ N. It is possible to rework the analysis in terms of forecasts for 
бақ = m2 м and to use the result that forecasts of s, and vf +n ought to be 
identical. This has been done for N — 1 but the improvements in forecast 
accuracy upon the benchmark forecast are usually less than those reported 
for the forecasts rh). For example, the average RMSE for rfi/?, is 0.848 
compared with an average of 0.905 for the corresponding forecast $5, 


defined by replacing each letter m by a letter s in (4.3.1) and (4.3.4). 


4.5 RECOMMENDED FORECASTS FOR THE NEXT DAY 


To predict tomorrow's conditional standard deviation v,,,, we must first 
choose a method for forecasting m;,,. The results just presented show 
several methods have almost identical accuracy, when measured by RMSE. 
Forecasts given by exponentially weighted moving averages (EWMA) are 
fractionally more accurate than the other forecasts and have two further 
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advantages. Firstly, the EWMA method requires less parameter estimates 
because ф is always 1 and so neither ф nor им need to be estimated. 
Secondly, if for any reason the unconditional variance changes, introduc- 
ing non-stationarity into the returns process, then the forecasts can reflect 
the variance change fairly quickly and accurately. These advantages of the 
non-stationary forecast are particularly relevant for real-time applications. 
When forecasting more than one day into the future it is less clear which 
forecast should be used. This is discussed more fully in Chapter 9. 

It is recommended that v,,, is predicted using an EWMA forecast for 
тыл, determined by a smoothing parameter y as follows: 


Vua = 41/6 


y $ (1 – у) Íx — x|/6 
i=0 


(1 — y), + ylx, = x|/8. (4.5.1) 


Implementing equation (4.5.1) requires values for ó, y, and x. There is no 
reason to doubt the assumption that returns have conditional normal 
distributions and thus ó = 0.798. For research studies, x can be taken to be 
the average return during a complete time series and this will be done in 
Chapters 5, 6, and 7. For real-time applications, it is time consuming to 
calculate x every day from the available returns. Simply setting x equal to 
zero іп (4.5.1) makes very little difference to the forecasts %;). This 
simplification is used in Chapter 8 when assessing trading rules and has 
appeared in previous publications by the author. 

Avalue y fora particular series could be found by retrospective optimiza- 
tion as described in Section 4.3. However, the average RMSE figures given 
in Section 4.4 show that using y — 0.1 for all series was as accurate as 
seeking an optimal y for each series. The best y for past data may not be 
optimal in the future because the RMSE of EWMA forecasts changes 
relatively little as y varies between 0.02 and 0.2. This explains why y — 0.1 
often gave better results than values obtained from optimization over the 
initial two-thirds of the returns. Back-optimized values of y have been 
found for each complete series by minimizing the criterion 


> ima = rh, 4 QDE 


summing over t from 21 to n — 1 inclusive. Тһе best values are recorded in 
Table 4.2. 

Based on these values and the realization that the best past y is not 
necessarily the best у for future forecasts, the values 0.04 for stocks and 0.1 
for currencies and commodities are recommended. These values are used 
in the remainder of this text. 
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Table4.2 Bestsmoothing constants y (back-optimized) 


Spotseries y Futures series y 
Allied 0.05 Corn (12) 0.09 
Alcoa 0.05 Corn (6) 0.09 
Am. Can 0.03 Corn (3) 0.10 
ATandT 0.03 Cocoa (12) 0.10 
Am. Brands 0.04 Cocoa (6) 0.09 
Anaconda 0.04 Cocoa (3) 0.08 
Bethlehem 0.03 Coffee (12) 0.11 
Chrysler 0.03 Coffee (6) 0.14 
Dupont 0.06 Coffee (3) 0.19 
Kodak 0.06 Sugar (12) 0.11 
G. Electric 0.06 Sugar (6) 0.10 
G. Foods 0.03 Sugar (3) 0.11 
G. Motors 0.04 Wool (12) 0.10 
G. Telephone 0.03 £/$ (6) 0.07 
Harvester 0.04 DM/$ (6) 0.09 
FT 30 0.06 SF/$ (6) 0.08 
Gold 0.17 T-Bond 0.04 
Silver 0.09 

Copper 0.09 

Lead 0.17 

Tin 0.13 

Zinc 0.12 

£/$ 0.11 


Table 4.3 Frequency distributions for forecasts of the conditional standard deviation. 
Percentage frequencies (F) and percentage cumulative frequencies (CF) 


Allied Gold Sugar (12) £/$ (6) 
Range F CF F CF F CF F CF 
0to0.005 0 0 3.7 3.7 1.2 1.2 583 58.3 

0.006 to 0.010 3.6 36 298 335 208 220 395 9.8 
0.011to 0.015 43.5 471 256 591 25.4 47.4 2.2 100 
0.016to 0.020 35.0 821 204 795 17.9 653 
0.021to 0.025 110 93.0 104 89.9 112 76.5 
0.026to 0.030 47 97.7 3.7 93.5 7.5 84.0 
0.031 to 0.035 1.8 99.5 2.0 95.6 5.8 89.8 
0.036 to 0.040 0.5 100 1.4 97.0 3.5 93.3 
0.041 to 0.045 0.9 97.9 25 95.8 
0.046 to 0.050 0.7 986 0.9 967 
More than 0.050 1.4 100 3.3 100 
Minimum 0.007 0.003 0.003 0.001 
Median 0.015 0.013 0.016 0.004 


Maximum 0.038 0.069 0.084 0.011 
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Figure 4.1 Gold prices (solid curve) and forecasts (dashed curve) of the return's 
conditional standard deviation from November 1979 to March 1980. Dashed curve 
is 10000 v, , , 


Examples 


Table 4.3 summarizes the recommended forecasts %;| for four series: 
Allied (the first US stock series), spot gold, sugar futures, and sterling/dollar 
futures. All four frequency distributions are highly right-skewed. This 
shows that there are occasional periods of time when the conditional 
standard deviation far exceeds its usual average value. Prices change by 
exceptional amounts during these periods. 

Figures 4.1 and 4.2 illustrate considerable changes in the forecasts, firstly 
for gold and secondly for sterling/dollar futures. Both illustrations are 
exceptional. Figure 4.1 shows 100 gold prices and forecasts centred on 18 
January 1980 when the closing price was a record high at $835. The 
conditional standard deviation remains high after the record price indicat- 
ing considerable uncertainty about future prices caused, according to 
popular opinion, by political problems involving Afghanistan. Figure 4.2 
shows sterling/dollar prices and forecasts for the June 1976 contract be- 
tween December 1975 and May 1976. After months of steady trading 
between $1.97 and $2.01, the price fell in one day from $1.9920 to $1.9425 
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Figure 4.2 Prices of sterling futures (solid curve), іп US dollars per pound, and 
forecasts of the return's conditional standard deviation (dashed curve) from 
December 1975 to May 1976. Dashed curve is 100 V,+ 


and then continued to fall. The forecast V, quickly moves from about 
0.0013 to more than 0.005. This sudden increase in price volatility is very 
much a special case for currency futures. 


4.6 SUMMARY 


Reasonable forecasts of the relevant standard deviation for the next return 
can be calculated from a weighted average of the absolute values of past 
returns. It will become clear in Section 5.8 that the forecasts have research 
applications: they are sufficiently accurate to help define much better 
estimates of the autocorrelation among daily returns. The forecasts also 
have practical applications: they provide a very convenient risk measure 
for traders and require minimal computational effort. 

The issue of stationary versus non-stationary models is not important 
when forecasting the next conditional standard deviation and the cautious 
strategy of selecting a non-stationary method is recommended. Forecasts 
ten or more days intothe future showthatstationary models are acceptable 
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for some series, particularly commodity series. It does appear that station- 
ary models can have important applications, discussed later in Chapter 9 
about the pricing of options. When a stationary model is applicable a 
product process gives a better approximation to the true returns process 
than does an ARCH process. 

Therefore two (or more) innovations are required per unit time if the 
returns process is to be modelled using independent and identically 
distributed innovations. 


Chapter 5 


The Accuracy of Autocorrelation Estimates 


The forecasts of the conditional standard deviations derived in Chapter 4 
can be used to construct appropriate series for autocorrelation tests of the 
random walk hypothesis. This chapter shows that the returns series is 
usually not appropriate but returns divided by their forecast standard 
deviations define a satisfactory series for random walk tests. 


5.1 INTRODUCTION 


Suppose n returns x, are used to calculate some autocorrelation 
coefficients r,,. Let r,, be the realized value of a random variable А, х 
and recall from Section 1.10 that standard tests based upon the approxi- 
mation var (№, х) = 1/n are actually joint tests for independence and 
finite-variance, identical distributions. This joint hypothesis was con- 
clusively rejected in Section 2.10. Of far more practical interest are tests 
of the hypothesis that returns have constant mean and are un- 
correlated, the definition of the random walk hypothesis used throughout 
the remainder of this book. The result var (А, х) = 1/n is not implied by 
the random walk hypothesis and thus standard tests, based upon an 
assumed variance 1/n, can be unreliable. This is demonstrated in 
Section 5.2 by considering extreme examples of non-linear, uncorrelated 
processes. 

A method for assessing the true variance of an autocorrelation 
coefficient is described in Sections 5.3 to 5.6 and evaluated for the 
forty time series in Section 5.7. The true variances are generally far greater 
than 1/n. Autocorrelation coefficients should have smaller variances if 
they are calculated from returns divided by their estimated conditional 
standard deviations. This rescaling method is described in Section 5.8 
and its usefulness for obtaining better autocorrelation estimates is 
assessed in the rest of the chapter. Random walk tests are presented in 
Chapter 6. 
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5.2 EXTREME EXAMPLES 


To illustrate the fact that n[var (А, x)] is not always approximately 1 for an 
uncorrelated process {X,}, we consider two examples based on the models 
introduced in Section 3.2. Firstly, suppose (X,) is constructed from a 
zero-mean, strict white noise process {U,} by: 


X, = o, U, for 1 < t < m, 
= aU, form + 1 < ten. 


Ignoring the sample mean X, the coefficients are 


Rx = > Xx. È Xt: (5.2.1) 
t=1 


Keeping о fixed at a positive number, let о; — 0. Then 


Rise SS ио. Ў 02. (5.2.2) 
t=1 t=1 

These limits are the autocorrelations of {U,} based on m observations. 

Such autocorrelations have var (А, u) = 1/m. Thus nivar (А, х) —^ n/m 

approximately. The ratio n/m can be arbitrarily high. 

Secondly, suppose (X,) is a stationary and uncorrelated process defined 
by X, = WU, with V, = оу or o; and (М) a Markov chain, stochastically 
independent of zero-mean, Gaussian white noise {U,}. Define the tran- 
sition matrix of the Markov chain by 


P(V, = m| V = а) =1- А, [= 1,2 
and assume 0 < Aj, А, < 1. Then if P(V, = a) is А›/(А + Az) = ті, Say, it 
will follow that P(V, = a) is z, for all t. As c; > 0,aresult like (5.2.2) can be 
obtained, except now the numbers of relevant times appearing in the 
numerator and denominator of K, x are two random variables. It can be 
proved that 


nivar (А, „)} > (1 — A?)/m (5.2.3) 


for sufficiently large n and small ratio с/о; . The proof is omitted. It follows 
from a general result proved in Section 5.5. The lower bound in (5.2.3) can 
be made arbitrarily high; for example, it is more than 8 when A, — 0.09, 
А, = 0.01 and so 7 is 0.1. 

In both of our extreme examples, several observations x, are irrelevant 
when the sample autocorrelations are calculated. The autocorrelations are 
therefore determined by less than n observations and so the variance of an 
autocorrelation coefficient becomes more than 1/n. Much the same situa- 
tion occurs for autocorrelations calculated directly from financial returns. 
Because of changes in variance or conditional variance, some observations 
are less relevant than others when autocorrelations are calculated. 
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A rather extreme illustration of this point can be given by considering the 
series of wool returns. Let s? be the variance for contract /, calculated from 
n; returns. There are 13 contracts, n, = 270 and n, = 250 for í > 2. The 
denominator of the autocorrelations is È (x, — x)? and this will approxi- 
mately equal X n,s;. It can be verified from Table 2.5 that, for the two years 
of highest variance, n,s¿ + n;s? = 0.37 but for the other eleven years the 
total of the products n,s? is only 0.19. Thus the observations x, during two 
years are far more relevant than all the others in determining X (x, — x)°. 
The same conclusion is valid for X (x, — x)/(x,,, — x) and hence also 
for r, y. 


5.3 A SPECIAL NULL HYPOTHESIS 


Our objective is to estimate the variance of R, x under the null hypothesis 
that {X,} is an uncorrelated process. The process {X,} could be non- 
stationary and it is therefore desirable to make as few assumptions as 
possible about the generating process. We will assume the X, have identical 
means, E[X,] = д. Initially we suppose и is known and then, without loss 
of generality, ш can be assumed to be zero. In Section 5.7 the equations 
defining the estimates are revised for the more realistic situation of a 
constant yet unknown mean. 

To obtain estimates, we make the assumption that if {X,} is uncorrelated 
then the multivariate density of any n random variables is symmetric. Let 


f(x,, X», ..., Xn) denote the multivariate density of the random variables X,, 
1 < t < n, let x denote the vector (хі, х, ..., Xn) and define |x| to be the 
vector (|x,|, |x; |, ..., |x, ). Then multivariate symmetry means the likeli- 


hood or probability (respectively for continuous or discrete distributions) 
of x is always the same as that of |x|; remember we are assuming u = 0. 
Multivariate symmetry defines the special null hypothesis 


H; : f(x) = f(|x|) forall vectors x. (5.3.1) 
When H;istrue, the bivariate density of X, and X,,,, denoted p(x,, x;,,), has 
DO, Xtar) = р(х, Xit) = p(—x,, х) = p(—x,, 7X4). 


Consequently, it can be verified that 


| | хр, X4,) dx, dx, = 0 


and so H; implies 
Ны: X, and X,,, are uncorrelated whenever т > 0. 


Obviously H, does not imply Hs. However, it is reasonable to assume that 
any acceptable model satisfying H, will also satisfy Hs. As noted previously 
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in Section 3.3, almost all the researchers who have studied realistic 
uncorrelated processes assume {U,} is Gaussian white noise. It then 
follows that Н; is true because firstly the U, are independently distributed 
and secondly the normal distribution is, of course, symmetric. 


5.4 ESTIMATES OF THE VARIANCES OF SAMPLE AUTOCORRELATIONS 


As the expected value of every X, is assumed to be zero, the sample 
autocorrelations can be defined for n observations by 


= - > ame f х2. (5.4.1) 


There is а negligible difference between this definition апа the опе 
adopted earlier when the expected value of X, is unknown (equation 
(2.9.1)). The sample autocorrelations are the realizations of the random 
variables R, x = E X,X,,,/Z X;. Leta, = var (R, х). Then we are interested in 
estimating о, using only the n observed returns and the assumption that 
the multivariate density is symmetric. 

The parameters o, depend on the multivariate distribution of the X, 
about which we have attempted to assume as little as possible. Given the 
observations x, (1 < t < n) and assuming H; is true, all that can be deduced 
about the density function is the following statement: every sequence xt, 
1«txn,forwhich each xf is either x, or —x, has equal likelihood, namely 
the likelihood of the observed data. There аге 27 such sequences if all the 
x, are non-zero: there are less if some zero returns are observed. 

Each sequence could have occurred and caused a particular realization 
of R, x. These equiprobable sequences thus provide a discrete conditional 
distribution for R, x. This is the distribution of R, x given that |X| = |x|, 
1 = t < n. Thevariance of this conditional distribution is later shown to be 


п-т п 2 
=> ыр x) : (5.4.2) 
еп = 


The number а, can and will be used to estimate о, . This estimate is unbiased 
when Hj is true, i.e. if a, is an observation of A, = > X2X1,,/(Z X2) then 
E[A,] = o,. There сап be some bias if H; is false. Its magnitude and the 
interpretation of a, in these circumstances are discussed in Section 5.6. 

The a, are the only obvious estimates of the parameters o, when the only 
assumption about the X, is multivariate symmetry. No alternative estimates 
are known. Other estimates might be obtained if further conditions about 
the X, are assumed. However, it is very difficult to select realistic extra 
conditions. The remainder of this section covers the derivation of a, and 
then shows there is no bias if Hs is true. Readers wishing to avoid further 
mathematical results should now move on to Section 5.6. 
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Let m, = |x] and M, = |X| for 1 < t < n and denote by m and M 
the vectors (m, My, ..., Ma) and (Mi, М;,..., Mn), respectively. It will 
be assumed that the X, have continuous densities so there is zero 
probability of observing x, — 0; itis straightforward to adapt the following 
algebra for discrete distributions having x, = 0 as a possible outcome. 
Assuming continuous densities, discrete random variables т, can be 
defined by X, = 7, M, and the only possible realized values of т; are 1 and 
—1. The probability of any feasible realization of = (ту, т, ..., 7,) is ап 
n-dimensional integral of the density function of x = (ху, х,..., Xp) 
over a region (x, > 0, all je J, x, < 0, all k є J with 1 < k < n), for some 
set J. Assuming H; is true, all these integrals and probabilities equal the 
same integral over the region (x; > 0,1 < j < n}. Consequently, ther, are 
independently and identically distributed with Pim, = 1) = P(a, = —1) = i 
when H; is true. Using this conclusion, var (R. 4|M = m) can be 
calculated. 

Writing R, х as Хтут,,,/М,М,,,/5 М?, 


FIR, x| m] => mm Ет. У, т? = 0 
because E[7,75,,] = 0 forall t and all т > 0. Thus 

var (К, x| m) = ЕК? x|m] = ELS тта тт.) m2)2. 
As ҒІіт,т,,,тұт,, 1 = 0 for s # t and 7 > 0, and F[m?72,,] = 1, it follows 
that when H; is true: 

var (R. x|m) = > mimi. (> m2)2. 

t=1 

This conditional variance is the estimate a, given in equation (5.4.2). Also, 
if A, = E М%М?,,(> М2), then for any m: 

E[A,|m] = Е[К? x|m]. 


So, unconditionally, E[A,] = E[R2 4] = var (К) = а, and hence a, is an 
unbiased estimate of a,, again assuming H; is true. 


5.5 SOME ASYMPTOTIC RESULTS 


The derivation of the estimates a, does not require the process (X,) to be 
stationary and the estimates should be useful for investigations of non- 
stationary data. It will be assumed in this section that (X,) is stationary and 
has finite kurtosis. Asymptotic limits can then be derived for the estimates. 
These limits indicate the probable magnitude of the estimates for various 
uncorrelated processes and also allow us to assess thesize of any bias when 
the process is not uncorrelated. 
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Suppose now that a,,, denotes the estimate of var (А, х) based on n 
observations, given by equation (5.4.2), and let b, , = n(a, n). Then 


b,n = 1 Sy x22 lee мү 
T,n n = ESFIS n s t 


and therefore, by the law of large numbers, 
b, n > B, = EIX XG VEG? (5.5.1) 


as n — о. A more convenient formula for B, can be found by applying the 
result that cov (X?, X2,;) equals E[X?X2,;] — E[X?]’, firstly with í = 7 and 
secondly with í = 0. This gives us the result that 8, depends on the kurtosis 
of the X,, denoted ky, and the autocorrelations of S, = X?: 


B, = 1 + (ky — Dp, s. (5.5.2) 


Clearly, B, = 1 forany strict white noise process, even if itis not symmetric. 
Also, when Hs is true, 


nivar (K, x)] > В, 


as n — о, because all the estimates а, „ are unbiased. We should expect 
the p, ; to be positive for returns and to decline towards zero as rincreases. 
The B, are then always more than one and decline towards one for large r. 


Linear processes 


If X, is a linear combination of the present and past terms from a zero-mean, 
Gaussian, white noise process (&), then ky = 3 and p, 5 = p? x (Granger 
and Newbold, 1976) and hence 8, = 1 + 2р2. When, however, the e, have 
anon-Gaussian distribution, the results about p, sin Section 2.10 show that 


B. = 1 + 2p? + (ky — 3)c, (5.5.3) 


for certain non-negative c, having a small total, i.e. X c, is small summing 
over positive r. 

Assuming {X,} is not strict white noise, Hs is false and the estimates a, can 
be biased. For example, if (X,) is a linear AR(1) process with autocorrela- 
tions é *, from Kendall and Stuart (1976, p. 472) we obtain: 


nivar (&,,)] > (1 + ф?)(1 — $?*y( — p?) — 27677 ер, 


Now if $ = 0.1 the correct asymptotic limit of n[var (А, x)] increases from 
0.99 (for + = 1) to 1.02 for large т. By comparison, B, = 1.02 and B, = 1for 
large r when the process is Gaussian. In this example X c, = ¢* = 0.01 so 
the term (ky — 3)c, in equation (5.5.3) will be negligible. We see that the 
bias in B, (and hence estimates a,) is very small for a linear, AR(1) process 
having a sensible, low autoregressive parameter. 
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Non-linear processes 


When (X,) is the product of stochastically independent, stationary pro- 
cesses (М) and {U,}, with our usual conditions on the mean and variance 
of U,, equation (5.5.1) shows that 


B. = КШ U (EL V2 V2, ШЕГУДЕ). (5.5.4) 


For strict white noise {U,}, the expectation outside the curly brackets is 
always 1 and the B, are determined by the mean, variance, and autocorrela- 
tions of (V2: 


B, = 1 + p, (var (V2)/E[V?]). 


For the special case when {log (V,)) is a Gaussian, AR(1) process with 
variance о? and autocorrelations ó, equations (3.5.6), (3.5.7), and (3.5.10) 
can be substituted into (5.5.2), to show that 


B, = exp (407% 7. (5.5.5) 


Some of the estimates of ø presented in Section 3.9 exceed 3 and generally 
¢ is estimated near 1, thus the В, could exceed e = 2.7 . . . for low т. 

When {U,} is not strict white noise, equation (5.5.4) shows that the 
autocorrelation in the process causes В, to be the figure for a comparable 
uncorrelated process multiplied by £[U7U7,,]. This multiplicative factor is 
1 + (ky — 1)р, u;, ky denoting the kurtosis of the U,. The factor will be very 
small. It would be 1 + 2р, u: for a Gaussian, linear (U,), then bounded 
below by 1 and above by 1.02 when |p, u| < 0.1. 

This multiplicative factor suggests a way to assess the bias in B, for a 
general autocorrelated process (X). Bias is caused by the autocorrelation 
in {U,}, so let ó, be the limit of n [var (K. u)] minus the factor 1 + (ky — 1)p, u2- 
Then 6, is the bias for process (U,) and it will be small; see the preceding 
sub-section about linear processes. Then the bias 


nivar (А, x)] — В, 


as n — о, will be approximately à,E[VZV2,,/E[V7I? from (5.5.4), i.e. 
approximately ó,, . 

For the uncorrelated ARCH process introduced in Section 3.6 the limit B, 
can only be derived for very special cases because it is difficult to find the 
kurtosis ky. Fora first-order ARCH process (equation (3.6.3)) with p, ç = $” 
and $? < 1/3, itis easy to show 


B, = 1 2ф7(1 — 3ф?). (5.5.6) 


Note that in both of equations (5.5.5) and (5.5.6) changes in conditional 
variance can cause the B, to be arbitrarily large. In each case the B, decrease 
monotonically towards 1 as 7 is increased. 
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5.6 INTERPRETING THE ESTIMATES 


Empirical estimates of the variances of autocorrelation coefficients are 
given in the next section. First, however, we clarify the interpretation of an 
estimate b, = n[a,] of n[var (К, х)]. 

Consider the following hypothetical example: 2500 returns yield 
rj, = 0.05 and b, = 2.25. Then assuming А; x has variance 1/n we would 
reject the hypothesis of strict white noise at a 5 per cent significance level, 
since r, х is more than 1.96/Vn = 0.04 approximately. However, Hs could 
well be true with А; x having variance2.25/n = 0.03? because r, ,/0.03 is only 
1.67. Thus we would not be willing to reject Hs nor the hypothesis H, of 
zero autocorrelation using ñx. There might, however, be a series of 
rescaled returns { y,) defined ina manner that does not introduce spurious 
autocorrelation. This series could have r, = 0.045 and an estimate b+ of 
n[var (К; у) equal to 1. Then we would be happy to reject Hs and also Hg at 
the 5 per cent level whenever the estimate b* is considered reliable. We 
define satisfactory series {y,} in Section 5.8. If you jumped Section 5.5 you 
are advised to read the next paragraph and then jump to Section 5.7. 

The estimates a, have been derived by assuming Hs is true. Researchers 
often want to test for zero autocorrelation. Thus it is not known, before 
testing, whether or not H; is true. How should b, be interpreted in these 
circumstances? The recommended answer has two parts. Firstly, b, is 
either unbiased or has negligible bias and, secondly, if the b, are far more 
than 1 then random walk tests based on the assumption var (К, х) = 1/nare 
unreliable. 

To motivate this answer, consider some of the asymptotic results 
recently obtained for stationary processes. When Hs is true, b, is an 
unbiased estimate of n[var (K. x)]. If, instead, Hs; is false, then b, can be 
biased but the proportional bias, i.e. 


{E[B,] — nivar (К, )])/(n[var (А, 40] 


will always be small for long series (b, is the observed value of В,). Тһе 
asymptoptic proportional bias was shown in Section 5.5 to be 3 per cent or 
less for all т when {X,} is Gaussian and AR(1), with maximum auto- 
correlation p, у = 0.1. In practice the proportional bias could be higher but 
it is very unlikely that it could exceed 10 per cent for a sensible model for 
daily returns. The issue of bias is therefore not very important when 
estimates b, are considered. 

Any test of the random walk hypothesis involving autocorrelation co- 
efficients requires the variance of coefficients when the null hypothesis of 
zero autocorrelation is true. Suppose X, = V,U, and {U,} is almost strict 
white noise, so, as in Section 3.7, U, = T, + e, with (e) strict white noise 
and var (7,) < var (ej). Then equation (5.5.4) shows, for the product of 
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stochastically independent and stationary processes, that the asymptotic 
estimates В, are almost identical for (a) the process (X,) and (b) the 
uncorrelated process (X7) defined by X7 = Мг). Thus, even if Hs is false, 
we can expect a, to be an appropriate estimate of the relevant sample 
variance when the null hypothesis is true. So, if the b, are significantly 
greater than 1 itis always inappropriate to attempt random walk tests based 
on the assumption var (R, x) = 1/n. 

The conclusions in the two previous paragraphs are deduced from 
results proved for stationary processes. It is possible to satisfy ourselves 
that these conclusions remain valid if (Vj) is not stationary and {U,} is 
approximately strict white noise. 

Estimates b, can have high standard errors, discussed further in Section 
5.9. Consequently the b, should be calculated for several lags 7 and a set of 
estimates considered. Fortunately when this is tried the conclusions are 
very incisive. 

When b, is clearly more than 1 the significancelevel of atest involving R, x 
will be underestimated. This can be very serious. For example, suppose 
that (ХУ) is uncorrelated and R, x ~ N(0, 2/n) but we use a 5 per cent 
significance level and falsely assume А, х ~ N(0, 1/n). Then the chance of 
rejecting the random walk hypothesis, using R, x for the test, is 


Р(|2,| > 1.96/\V/n), with Z, ~ NO, 2/n), 
and this probability is 

P(\Z,| > 1.96//2), — with Z, ~ N(O, 1) 
which equals P(|Z,| > 1.386) = 17 per cent. 


5.7 THE ESTIMATES FOR RETURNS 


The estimates а, of a, = var (А, x) derived in Section 5.4 apply the assump- 
tion that the means of the X, are all zero. For a known constant mean u, all 
terms x, in (5.4.2) would be replaced by x, — д. To calculate empirical 
estimates of the autocorrelation variances, all terms x, are replaced by 
x, — X. The estimate b, of n[var (А, x)| then becomes 


п- т п 2 
b, = п Y (х, — X)2(x,,, — ТЕ (x, — еу) (5.7.1) 
t=1 
For futures series the numerator summation in (5.7.1) is restricted to those 
times t for which x, and x;,, are returns from the same contract. 

Estimates b, are presented in Table 5.1 for lags 7 from 1 to 5 days for all 40 
series. Many estimates are greater than 2. Several commodity estimates 
exceed 3. Figures 5.1, 5.2, and 5.3 display the estimates up to lag 30 for the 
first share series, the gold series, and the longest sugar futures series. 
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Figure 5.1 Estimates of the variances of autocorrelation coefficients multiplied by 
the number of observations for the first US stock series. Estimates b, for returns (+) 
and b* for rescaled returns (X) 


These estimates tend to decrease as 7 increases and so they are fairly 
consistent with the general asymptotic results of Section 5.5 and the special 
result (5.5.5). 

Stocks generally have lower estimates than currencies or commodities. 
This is made clear by the following medians and means. 


Median estimate Mean estimate 
Lag Lag 


1 2 3 4 5 1 2 3 4 5 


15US stocks 1.81 1.43 1.43 1.47 1.36 1.79 1.46 1.47 1.43 1.44 
4currency series 2.02 1.62 1.50 1.48 1.59 2.01 1.66 1.55 1.56 1.70 
19 commodity series 3.06 3.14 2.83 2.49 2.57 4.13 3.42 3.08 2.85 2.90 


The minima of these summary statistics are 1.36 for the stocks, 1.48 for the 
currencies, and 2.49 for the commodities. Even if the actual variances of 
coefficients R, х were as low as these minima, divided by n, it would be 
necessary to change the standard methodology used in random walk tests. 
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Variance estimate 


Time lag in days 


Figure 5.2 Estimates b, and bš for gold returns (+) and rescaled returns (х) 


The estimates b, prove conclusively that the autocorrelation coefficients 
of daily returns have variances greater than 1/n. Therefore random walk 
tests should not assume the variances of coefficients equal 1/n. A higher 
variance is needed when testing for zero autocorrelation in the returns 
process by examining the sample autocorrelations. Using 1/n for the 
variances is only acceptable when testing for independence and identical 
distributions, i.e. strict white noise (SWN). However, we saw in Section 
2.10 that the SWN hypothesis is false for all series. 


5.8. ACCURATE AUTOCORRELATION ESTIMATES 


The high variances of conventional autocorrelation coefficients are almost 
certainly caused by the non-constant conditional variances of the returns. 
This is shown to be a possible explanation by various equations in Section 
5.5, including (5.5.5). To obtain accurate coefficients we need a series 
possessing a reasonably homogeneous variance. Ideally this would be the 
series {u,} in which u, equals (x, — &)/v,. As u, is the realization of exact or 
approximate strict white noise {U,}, coefficients calculated from the {u;} 
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Figure 5.3 Estimates b, and b? for sugar futures returns (+) and rescaled 
returns (X) 


would have orthodox variances, i.e. var (А, џ;) = 1/п. Of course, the 
realized conditional standard deviations v, are not observable so u, cannot 
be calculated. 


Rescaled returns 


An approximation to the standardized return u, = (x, — &)/v is given by 
substituting estimates for и and v,. Using X for апа a forecast ú, made at 
time t — 1 gives the rescaled return defined by 


y, = уу, (5.8.1) 


t 
= u,lv,/%). 


The rescaled return will be similar to the unobservable standardized return 
whenever v, is a good forecast. Removing x from (5.8.1) and the calculation 
of ¥,, as in the author's previous publications, produces almost identical 
results. 

The forecast ú recommended in Chapter 4 is always used in the calcula- 
tion of y. Initially, 
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20 
(4 = 1.253 У |x, — x|/20 (5.8.2) 
t=1 


and thereafter 
Ú = (1 — у)й— + 1.253y| x1 — x]. (5.8.3) 


For stock series y = 0.04 and for all other series y = 0.1. Futures series 
need initial forecasts for each contract. For contract i, contributing returns 
at times 7; + 1 to 7, inclusive to a time series {x}, the twenty returns on 
contract í from times 7, | — 19 to 7; , are used to calculate the initial 
forecast (i > 1). 

It is recommended that autocorrelation coefficients are calculated from 
the series of rescaled returns {y,}. These coefficients are 

n-T = п 
ту = >, (y, У) (унт — p 5, (y, — yx. (5.8.4) 

Times before 21 are left out of (5.8.4) because forecasts V, are not attempted 
for such times. The coefficients r,, are now shown to have acceptable 
variances, i.e. if n* = n — 20 then var (К, y) = 1/n*. 


Variance estimates for recommended coefficients 


The estimated variances of the coefficients for returns {x,} were obtained 
by assuming the multivariate symmetry hypothesis H;. When Hs is true for 
{X,} it will also be true for the process { Y,} generating the rescaled returns 
{ у), if we ignore the difference between x and ш. This conclusion follows 
from the definition of v,; it is a function of past returns which does not 
depend on the signs, positive or negative, of x, ; — X, i > 0. Consequently, 
the variances of autocorrelation coefficients calculated from rescaled 
returns can be estimated by the method applied to the returns. 

Consider r, у to be the realized value of К, у. Then the estimate Б? of 
п*[уаг (А, y)] is 


bt = (n — 20) Y (y — V) Ye — А > iue д (5.8.4) 
t=2 t=21 

The first twenty returns are needed to obtain the initial V, and hence the 
summations in (5.8.4) begin at t = 21. Table 5.1 presents the estimates b* at 
lags 1 to 5 for comparison with the estimates b,. АП but five of the 200 
tabulated estimates b, are greater than the corresponding estimate b7. 
Therefore r, , is expected to be a more accurate estimate than r, ,. Figures 
5.1, 5.2, and 5.3 compare b, and 6$ graphically up to lag 30. 
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Table 5.1 Estimates of n[var (R.)] for returns and rescaled returns 
Estimates b* from rescaled 
Estimates b, from returns x, returns y, 
Lag Lag 
1 2 3 4 5 1 2 3 4 5 

Spot series 
Allied 2.15 1.48 1.40 1.54 1.85 1.37 1.16 1.05 1.02 1.06 
Alcoa 1.92 1.18 1.47 1.51 1.55 1.27 0.94 0.97 0.93 1.07 
Am. Can 1.36 1.22 1.28 1.24 1.23 1.17 1.03 1.08 0.97 0.93 
ATandT 1.71 1.13 1.36 1.13 0.97 1.33 0.99 1.24 1.08 0.84 
Am. Brands 1.91 1.60 1.39 1.26 1.21 1.97 1.48 1.15 1.14 1.03 
Anaconda 1.70 1.41 1.43 1.22 1.11 2.09 1.18 1.60 0.86 0.89 
Bethlehem 1.47 1.50 1.28 1.15 1.07 1.34 1.14 1.00 0.95 0.88 
Chrysler 1.68 1.30 1.33 1.47 1.25 1.23 0.97 1.00 1.14 0.88 
Dupont 1.73 1.76 1.50 1.72 1.43 1.33 1.22 1.02 1.14 1.06 
Kodak 1.96 1.74 1.96 1.65 2.55 1.11 1.08 1.13 1.05 1.10 
G. Electric 1.81 1.89 1.81 1.79 1.91 1.19 1.07 1.02 1.05 1.00 
G. Foods 1.88 1.51 1.48 1.52 1.48 1.25 1.02 1.03 1.02 1.10 
G. Motors 2.07 1.43 1.59 1.46 1.40 1.27 1.06 1.00 0.97 1.05 
G. Telephone 1.61 1.32 1.35 1.48 1.36 1.07 0.95 0.92 1.19 0.96 
Harvester 1.82 1.43 1.45 1.29 1.19 1.34 1.02 0.99 1.07 0.97 
FT30 1.338 1.49 1.32 1.32 1.43 0.98 1.11 1.04 1.04 1.05 
Gold 2.70 4.07 2.53 3.67 3.11 1.47 1.65 1.22 1.05 1.07 
Silver 3.14 4.54 4.45 3.14 4.82 1.14 1.12 0.78 1.20 1.11 
Copper 3.06 2.07 1.96 1.82 1.98 1.75 1.05 1.08 0.93 0.94 
Lead 4.09 2.60 2.59 2.05 2.03 2.72 221 1.32 0.92 1.03 
Tin 3.17 3.59 3.64 3.78 3.52 1.27 1.27 0.98 1.25 0.97 
Zinc 5.10 4.52 3.81 5.52 4.98 1.61 0.99 1.01 1.03 1.02 
£/$ 2.11 1.55 1.51 2.05 2.14 3.94 0.42 0.82 0.76 0.43 
Futures series 
Corn (12) 3.49 2.69 2.62 2.63 2.38 1.13 1.21 0.96 1.08 0.97 
Corn (6) 2.84 3.17 2.87 2.85 2.78 1.14 1.05 1.10 0.97 0.93 
Corn (3) 2.85 3.14 2.83 2.77 2.67 0.97 1.05 1.05 0.98 0.99 
Cocoa (12) 2.62 2.49 3.16 1.78 2.08 0.95 0.96 1.03 1.16 1.08 
Cocoa (6) 2.51 2.36 2.83 1.64 1.85 1.00 0.95 0.95 1.01 0.87 
Cocoa (3) 2.36 2.08 2.55 1.65 1.63 1.00 0.91 0.92 1.12 0.84 
Coffee (12) 5.77 3.33 2.57 2.01 2.57 2.38 1.25 0.92 0.94 0.86 
Coffee (6) 6.92 3.99 3.16 2.31 3.09 2.60 1.15 0.98 1.00 0.85 
Coffee (3) 12.55 6.34 3.38 2.65 2.86 2.31 1.11 1.05 1.04 0.81 
Sugar (12) 2.74 2.85 2.44 2.49 2.57 1.17 1.12 1.02 1.11 1.18 
Sugar (6) 2.61 2.62 2.39 2.33 221 1.22 1.07 1.10 1.07 1.11 
Sugar ((3) 2.50 2.58 2.27 2.21 2.10 1.27 1.09 1.07 1.03 1.17 
Wool (12) 7.43 5.99 6.45 6.82 5.93 1.21 1.45 0.87 1.07 0.97 
£/$ (6) 1.89 1.44 1.50 1.23 1.57 1.98 0.83 1.25 0.75 0.84 
DM/$ (6) 2.08 1.95 1.71 1.57 1.61 1.34 1.25 1.06 0.93 1.04 
SF/$ (6) 1.96 1.69 1.49 1.39 1.46 0.96 1.08 1.03 0.86 1.05 
T-bond 1.09 1.27 1.26 1.04 1.08 0.85 1.02 0.96 0.77 0.85 
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Summary statistics for the bš are as follows. 


Median estimate Mean estimate 
lag Lag 


1 2 3 4 5 1 2 3 4 5 


15 US stocks 1.27 1.06 1.02 1.05 1.00 1.36 1.09 1.08 1.04 0.99 
Acurrency series 1.66 0.95 1.05 0.81 0.94 2.06 0.90 1.04 0.83 0.84 
19 commodityseries 1.22 1.11 1.02 1.04 0.97 1.49 1.19 1.02 1.05 0.99 


These statistics are much closer to 1 than the respective statistics for 
returns. The autocorrelation variances are satisfactory for rescaled returns 
at lag 2 or higher, for then var (А, y) = 1/n*, but they are not entirely 
satisfactory at lag 1. 


Exceptional series 


Plots of b, against 7 show the autocorrelation variances exceed 1/n for 
returns, at several lags r, whichever series is considered. With two excep- 
tions, plots of b* against т show the variances are about 1/n for rescaled 
returns although the estimates are often scattered far above and below the 
line b* — 1. The exceptional series are the share Anaconda and the spot, 
sterling/dollar exchange rate. For example, the Б? range from 0.13 to 13.27 
for the currency series, considering lags 1 to 50. These meaningless 
estimates are caused by extreme outliers in the series ( y, }. Extreme outliers 
can dominate the estimates because they are squared in the calculations. 
Clipping outliers more than two standard deviations away from the mean 
provides satisfactory estimates. Clipping financial time series is discussed 
by Granger and Orr (1972). 


5.9 SIMULATION RESULTS 


Neither the estimates b, nor the b* are approximately constant for all т, 
suggesting the estimates are not particularly accurate. Accordingly, four 
processes {X,} have been simulated to provide information about the 
distributions of the estimates of autocorrelation variances. Two processes 
are strict white noise, the first is Gaussian and the second has the Student's 
t distribution with six degrees of freedom. The third process has a step 
change in variance after half the observations have been simulated; each 
half is Gaussian, white noise and the second half has variance equal to four 
times the variance in the first half. The other process is an example of the 
special product process described in Chapter 3; X, = V,U, with (Uj 
Gaussian white noise and (log (V,)} a Gaussian AR(1) process with auto- 
correlations 0.95" and variance 0.25. 
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Table 5.2 Summary of simulated estimates for series of 1000 returns 


Gaussian t, Variance Fluctuating 
noise noise jump variances 

For 100 estimates b, 

Mean 0.988 1.014 1.323 2.401 
Standard deviation 0.057 0.139 0.110 1.690 
For 100 estimates b 

Mean 0.906 0.864 0.915 1.187 
Standard deviation 0.067 0.106 0.073 0.152 


Reproduced from Taylor (1984) by permission of the Royal Statistical Society. 


Each process has been simulated 100 times for series lengths n equal to 
500, 1000, or 2000. Table 5.2 summarizes the simulated estimates b, and bš 
for n — 1000. 

Several conclusions are offered about the random variables B, generat- 
ing the estimates b,. Firstly, for Gaussian noise the standard deviation of B, 
is small. Secondly, high kurtosis in the distribution of X, gives a far higher 
standard deviation for B, than the comparable figure for Gaussian noise. 
Тһе & distribution has kurtosis equal to 6 and the standard deviation of B, 
is then about twice the Gaussian figure. Thirdly, a step change in variance 
increases the mean and variance of B, but not substantially. Multiplying the 
variance of normal variables by A after half the observations gives an 
asymptotic limit equal to 2(1 + A?)/(1 + A)? for all the b,. The simulations 
had A = 4and asymptotic limit equal to 1.36. The average b, was 1.32, 1.32, 
and 1.35 for n — 500, 1000, and 2000. Fourthly, the B, have very high 
standard deviations when the conditional variance of X, fluctuates. For the 
product process and series lengths considered, very approximately 
sd(B,) = 1.0, sd(B4,) = 0.6, and в4 (В) = 0.2. The distribution of B, has 
high, positive skewness and is highly leptokurtic for the product process. 
In our example the asymptotic limit of b, is B, = 2.59, using (5.5.5). The 
average value F[B,] divided by B, is approximately 0.85 for n = 500 and 0.92 
for n — 2000. 

Simulated estimates b* for rescaled returns are satisfactorily close to the 
desired value of 1 needed for standard autocorrelation tests. The estimates 
b* have much smaller means and standard deviations than the b, for the 
realistic fluctuating variance process. 


5.10 AUTOCORRELATIONS OF RESCALED PROCESSES 


Autocorrelation estimates calculated from rescaled returns are less vari- 
able than estimates calculated from returns. Rescaling may be thought to 
create spurious autocorrelation. This section uses theoretical arguments 
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to assert that such anxieties about rescaling are misfounded. Simulations 
in Section 6.9 confirm this view. 

Suppose first that U, is independent of all U,_; and V,_;,,, í > 0, as 
frequently assumed before, so (X) is uncorrelated. Also assuming X = u, 
the random variable Y, generating rescaled returns is U,(V,/V,), ELY,] = 0 
and 


cov (Y,, Yair) = ЕО, ЈЕО, V ЖҰҒА) - 0 


for + > 0, showing (Y,) is then uncorrelated. Now X is not ш but their 
difference will certainly be small for long series. As 


Y, = (X, — X)/V, = UVV) — (X — w/v, 


if this definition is applied to n variables X, and n forecasts V, to produce n 
rescaled variables Y, it can be shown that 


cov (Y,, Y,,,) = —1/п, О<т< п. 


As var (Y,) is very approximately 1, rescaling cannot cause important 
spurious autocorrelation when the returns process is uncorrelated abouta 
constant mean. 

When all processes are stationary and (X,) may be autocorrelated, the 
following inequalities were obtained as (3.7.5) for product processes: 


p. у € p, xp, u < 1. (5.10.1) 


For low т the estimates in Chapters 3 and 4 show p, y = 1 whilst, for high т, 
р.х = 0 and so p, x = p, y for all т. The autocorrelations of { Y,} cannot be 
described so easily. Series must be simulated to learn much and this will be 
done for various autocorrelated processes in Chapters 6 and 7. 


5.11 SUMMARY 


Autocorrelation coefficients calculated from returns have variances greater 
than 1/n. This is a consequence of the changes in returns variance or 
conditional variance identified in previous chapters. Assuming the 
autocorrelation variance is 1/n when the true variance is greater will make 
random walk tests unreliable. 

Estimates b, of ntimesthe autocorrelation variances should be calculated 
for several lags +. When the estimates are far more than 1 something must 
be done to improve random walk tests. Action is necessary for all the 
returns analysed here. Rescaling the returns produces series whose 
autocorrelation variances are generally satisfactory. 


Chapter 6 -- 


Testing the Random Walk Hypothesis 


The random walk hypothesis is now tested and rejected for every type of 
financial asset considered. Naturally the statistical dependence identified 
is very small and long series and powerful tests are required to prove it 
exists. Departures from random behaviour are consistent with price-trend 
models for commodities and currencies but not stocks. 


6.1 INTRODUCTION 


Several definitions of the random walk hypothesis have been offered, as 
noted in Section 1.5. The hypothesis that returns have independent and 
identical distributions (i.i.d.) was rejected comprehensively in Section 
2.10. Changes in either variance or conditional variance can suffice to 
explain the rejection of the i.i.d. hypothesis (Chapter 3). Thus the interest- 
ing hypothesis of uncorrelated returns remains untested when the i.i.d. 
hypothesis is refuted. This is true even if standard autocorrelation tests are 
applied to returns (Sections 5.6 and 5.7). 

A more general null hypothesis can be defined by firstly replacing 
identical distributions by identical means and secondly replacing indepen- 
dent distributions by uncorrelated distributions, to give 


Hy: EIX} = E[X,,] апа cov (X,, X+) = 0 
forall tandall т > 0. (6.1.1) 


This is our definition of the random walk hypothesis. It has been used 
before, for example by Granger and Morgenstern (1970). Note that H, does 
not require the process (X,) to be stationary. 

Zero autocorrelation is sufficient, at a practical level, to ensure that 
prices recorded before time tare irrelevant for forecasting prices at times 
after t. Only the latest price and the assumption of identical means are then 
needed to find optimal forecasts. Identical means are included in H, to 
ensure the expected value of a sample autocorrelation coefficientis almost 
zero when the X, are uncorrelated. Means defined by financial theories 
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need not be constant. Joint tests of such theories and zero autocorrelation 
are given in Section 6.11 after the results of random walk tests. 

Rejection of the random walk hypothesis is not sufficient to refute the 
efficient market hypothesis. Trading costs can prevent the exploitation of 
statistical dependence and then the random walk hypothesis is false but 
the efficient market hypothesis is not. It is easier to test for randomness 
than for efficiency so it is best to test the random walk hypothesis first. The 
other hypothesis is investigated in Chapter 8. 

Many ways to test the random walk hypothesis have been proposed. 
Following methodological remarks in Section 6.2 and a review of the 
distributions of sample autocorrelations in Section 6.3, a variety oftests are 
described in Section 6.4. Powerful tests are based upon an understanding 
of plausible alternatives to random behaviour. Consequently, autocorrela- 
tions and models consistent with price-trends are discussed in Section 6.5 
and used to motivate appropriate test statistics in Section 6.6. The effects of 
errors in studied prices are noted in Section 6.7. After these preliminaries 
the results of random walk tests are given in Sections 6.8 and 6.9 and test 
power is discussed further in Section 6.10. 

It is concluded from the tests that returns are slightly autocorrelated. 
Sections 6.11 and 6.12 show the dependence cannot be explained by 
changes in interest rates or risk premia, by calendar effects or by market 
regulations. 


6.2 TEST METHODOLOGY 


Reliable statistical tests reject a true null hypothesis for a proportion of 
datasets equal to or less than the significance level. In other words, the 
significance level is the maximum probability of a Type | error. Correct 
distributions must be used for sample autocorrelations to give reliable 
tests of the random walk hypothesis. Otherwise, the significance level will 
be misleading. In particular the significance level is understated when 
autocorrelation variances are higher than the results given by large sample 
theory for a strict white noise process. It was shown in Chapter 5 that this 
сап bea very serious problem. It can be avoided by using rescaled returns. 

The choice of significance level is always arbitrary to some degree. A5 
per cent level is used throughout the chapter. 

Powerful tests reject a false null hypothesis as frequently as is possible. 
Such tests minimize the chance of a Type ПІ error for a given significance 
level. Many popular test procedures are not powerful. For example, the 
common practice of applying two-tail tests to a number of autocorrelation 
coefficients, one by one, must give confusing conclusions when some 
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coefficients are significant but several are not. What is needed are specific 
plausible alternatives to the random walk hypothesis and tests having high 
power when these alternatives are better models than the random walk. A 
practical alternative is trends in prices. Models are given in Section 6.5 and 
tests are derived in Section 6.6. 

The rejection region for a test can be either one or both tails of a 
particular distribution. Deciding to use one tail rather than both can 
increase test power. 

Several random walk test statistics are available so it is tempting to 
perform several tests. This is understandable yet causes difficulties when 
the various tests yield different conclusions, as often happens. In principle 
the most likely alternative hypothesis should be identified, then a powerful 
test selected and the test conclusion accepted. However, few researchers 
would be disinterested in the results from other tests. A compromise 
methodology is followed here. A favoured alternative hypothesis and test 
are studied, taking other statistics seriously only if their test results are 
particularly clearcut. 


6.3 DISTRIBUTIONS OF SAMPLE AUTOCORRELATIONS 


The sample autocorrelation at lag 7, considered as a random variable based 
on n random variables X,, is 


к, = Y (X, — ХХ, — 2/3 (X, Ху, X = Y Хп. 
t=1 t=1 t=1 


Replacing X, by the observed return x, and X by x gives the observed 
autocorrelation r,. A second subscript is not needed for К, and r, in this 
section; all results are for a process {X;}. 

Information about the distribution of R, for true and false null hypotheses 
is needed to obtain powerful random walk tests. We have already seen 
asymptotic results, as n — œ, for strict white noise processes (Section 
1.10). These are now extended to include autocorrelated linear processes. 
It is assumed the X, have finite variance but not necessarily a normal 
distribution. 

Very low autocorrelation occurs when the random walk hypothesis is 
false thus test statistics may only just be significant. Relying on asymptotic 
results can then require justification, so finite sample results are also 
summarized. 

Theoretical conclusions generally assume a linear generating process. 
Conclusions are given whilst acknowledging that the returns process is not 
linear. Nevertheless insights from linear conclusions are helpful for tests 
on areturns process rescaled to make it approximately linear. 


136 Modelling financial time series 


Asymptotic limits 


The asymptotic distribution of VnR, is N(0, 1) for independent and identi- 
cally distributed (i.i.d.) Х, for all 7 > 0. Also К, and K; are asymptotically 
independent for т # £. 

Many tests rely solely on these results. To derive new tests in Section 6.6, 
results are also needed for autocorrelated processes. For linear {X}, 
defined in Section 1.10, the asymptotic expected value of R, is the popula- 
tion autocorrelation р, and the asymptotic variance of VnR, equals 


Wrz = Ag + А, + 209p? — 2A, p,) (6.3.1) 


with 


As n — v, Vn(R, — p,) > N(0, œw). These equations show the variance w, 
depends on all the autocorrelations, not simply p,. The asymptotic 
covariance between V/nR, and V/nR; equals 


= = Agr + Ness + 2(Àop; De = A; Pg - À¿p;). (6.3.3) 
Also the asymptotic distribution of the vector 
Vn(R, — pi, R, — р, ..., Re — ру) 


is multivariate normal for any k, with means, variances, and covariances as 
above. All the asymptotic results are proved by Anderson and Walker 
(1964). 


Finite samples 
Sample coefficients К, always have a bias of order 1/n. From Moran (1967), 
E[R,] = —(n — т)/{п(п — 1j (6.3.4) 


fori.i.d. variables. The bias for a general linear process is given by Lomnicki 
and Zaremba (1957). It can be non-trivial and will be mentioned again in 
Section 6.7. 

Next consider variances and covariances for the K, calculated from i.i.d. 
variables. Application of a method used by Moran (1967, pp. 396-7) proves 
the conclusion 


var (R,) < 1/n for + > 0 and n > r. (6.3.5) 
The alternative definition of sample autocorrelation, 


R: = nR,/(n — т), 


Testing the random walk hypothesis 137 


has the disadvantage that its variance can be slightly more than 1/n (Taylor, 
1982b, p. 42). More algebra shows the covariance between coefficients at 
different lags is of order 1/n? with 


cov(&, R) «0 forn > 2€> 2: > 0. (6.3.6) 


These results can increase our confidence in the reliability of asymptotic 
results. For example, suppose the random walk test statistic is K, with 
rejection of the null hypothesis if r, > 1.645/V/n, a one-tail test. Then (6.3.4) 
and (6.3.5) show the actual significance level is marginally less than the 
nominal 5 per cent level when a normal distribution is suitable for R,. More 
exact results require computer simulations and examples are presented 
later in this chapter. 


6.4 A SELECTION OF TEST STATISTICS 


Autocorrelation, spectral and runs statistics have been used by scores of 
researchers to investigate the random walk hypothesis. Some articles 
report only one type of statistic, others all three. Some researchers have 
simply given the statistics, others have used them for tests. There is no 
consensus in financial literature about an appropriate set of test statistics. 
Results will be given for eleven test statistics. Fight of them are now 
described. They are identical or similar to most published tests. Three 
further tests are defined later, motivated by the idea of price-trends. It is 
assumed that autocorrelations r, are computed from n observations by a 
method ensuring that standard asymptotic results (Section 6.3) are applica- 
ble when the random walk hypothesis, denoted My, is true. 


Autocorrelation tests 


A popular and simple test uses the first autocorrelation coefficient, reject- 
ing Hy at the 5 per cent significance level if үл |, | > 1.96. This test applies 
the result VnR; ~ N(0, 1), approximately, when H, is true. Тһе test is 
logical and powerful if any dependence between returns is expected to be 
confined to consecutive returns. 

When many coefficients r, are considered some will probably be signifi- 
cant even if H is true; on average 1 out of 20 would then be significant at 
the 5 per cent level. Finding a few significant coefficients would not tell us 
much, unless their lags could be explained theoretically or some overall 
test statistic is used. The binomial distribution gives the probability of N or 
more out of К coefficients being significant at, say, the 5 per cent level when 
Н, is true. Then Нұ can be rejected if this probability is less than 5 per cent. 
To obtain a useful test requires numbers k and М, for which 
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k! 
am К D! 


One suitable choice 15 k = 28 and № = 4, then а = 0.049. Define М, by 
counting the number of coefficients г, having |r,| > 1.96/V/n, 1 < т < 28. 
Then H, is rejected at the 5 per cent level if N, > 4. 

^ more natural way to combine k coefficients into a single test statistic is 
given by 


(0.05) (0.95) <” = 0.05. 


k 
Q, = n У r. (6.4.1) 

т=1 
When A, is true, п(Кї + ++- + RẸ) ~ xz, applying the assumed indepen- 
dence of R, and Ку, т Z <. The null hypothesis is rejected for sufficiently 
high values of Q,. Results will be given for three tests, using k = 10,30, and 
50. The respective tail areas in which H, is rejected are Q4, > 18.31, 
Оз > 43.77, and Qs, > 67.50. The Q statistic is frequently used in general 
time series research to test for zero autocorrelation. Note that both N, and 
Q; are not designed to be powerful for a specific alternative to the random 

walk hypothesis. 


Spectral tests 


Spectral analysis is an alternative to studying autocorrelations. Itis particu- 
larly appropriate when cycles in returns are the preferred alternative to 
random behaviour. A typical cyclical model is 


J 
X, = У оу cos (wt В) + & (6.4.2) 
із 


the о, Bj, апа w being constants and the ғ, uncorrelated. Cycle j then has 
frequency w; and repeats itself every 27/w, time units. The evidence for 
cycles is not impressive and consequently the discussion of spectral 
methods will be brief. Granger and Newbold (1977, Ch. 2) describe spectral 
theory relevant for economic studies and Praetz (1979a) highlights practical 
problems encountered when testing returns for a flat spectral density. 
Autocorrelation analysis is much easier than spectral analysis. 

The spectral density function for a stationary process сап be defined by 


slw) = eias + 2 Y p, cos ew) | 0 < o < 2T (6.4.3) 


7-1 


with c? = var (X). The integral of s(w) from 0 to 27 equals о? and 


s(w) = 5(2т — в) so itis only necessary to consider the frequency range 0 
to т. There will be peaks in a plot of s(w) at the frequencies аҙ for the cyclical 
model (6.4.2). If, however, the random walk hypothesis is true s(w) will be 
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constant for all o. To test H, we need to estimate в(о) and then test for a 
constant spectral density. 
Meaningful estimates of Қо) = 275(»)/a? have the general form 
M-1 
Қо) = 1 + 2 У A,r, cos (ro) (6.4.4) 

r=1 

with positive and monotonically decreasing À, ensuring consistent esti- 

mates. The Parzen weights are used here, defined for fixed M by 


à, = 1 670(М – TYM? 0б<т<=мМ/2 


6.4.5 
= (М — 7)3/M3 MR < т< M. ( ) 


Equation (6.4.4) shows spectral estimates are linear functions of the first 
M — 1autocorrelations. Plots of these r, and the estimated spectral density 
Қо), 0 < о < т, convey the same information. The spectral plot will be 
more helpful when cycles exist. 

Spectral estimates have been calculated using M — 100 for spot series 
and futures series taking a year of returns from each contract. Futures 
series using less returns per contract must have smaller M, here 40 for six 
months data per contract and 20 for three months per contract; these 
series are not very suitable for estimating spectral densities. 

Praetz (1979a) shows that Қаң) and f(w;) are correlated estimates only if 
[оу — œ| < Зт/ М. Consequently tests are here based on Ёо) evaluated for 
w = 0, Ат/М, ӛт/М,..., т, giving 1 + M/4 potential test statistics. These 
statistics can be standardized using the asymptotic theory for sample 
autocorrelations to give 


M-1 

f, = ІКА/л/М) — пуа У [A, cos isjrmiaay Pin} (6.4.6) 
т=1 

forj = 0,1, ..., М. The f are effectively independent observations from 

N(0, 1) for large sample sizes n when Ho is true. 

The most plausible cycle period is one week, when the frequency is 
w = 27/5 and the standardized spectral statistic is f, j = M/10. This test 
statistic will be denoted f, to emphasize the period tested. A one-tail test is 
appropriate. 

Some authors, see Praetz (1979a), have counted the significant peaks and 
troughs in plots of the estimated spectral density. Let N, count the number 
of times И exceeds 1.96 for 0 < j < M/4. Like the number of significant 
autocorrelations N,, the test statistic М, has a binomial distribution when H, 
is true. We reject Hy if N, is greater than or equal to a number fixed by М; 
reject at the 5 per cent level if N, = 4 (M = 100) or N, > 3 (M = 40) or 
N, 2 2 (M = 20) the exact significance levels then being 3.9, 1.5, and 3.3 
per cent. Spectral tests will be given for f,, N,, and a third statistic 
motivated later by the idea of price-trends. 
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The runs test 


As returns have a non-normal and perhaps non-stationary distribution, 
non-parametric tests could be appropriate. The only non-parametric statis- 
tic used to date is the total number of runs. A positive run is a sequence of 
consecutive positive returns, a no-change run is a sequence of zero 
returns, and a negative run has a similar definition. Let x? be 1,0, or —1 for 
positive, zero, or negative x, respectively. Also let h, be 0 if x? = хі, and 
let it be 1 otherwise. Then h, = 1 signifies that x,,, begins a new run and so 
the total number of runs of all types is 


п-1 
H=1+ YR. (6.4.7) 
t=1 


Suppose there are n, positive returns, m zero returns, and л; negative 
returns in aseries. Then the mean and variance of the variable H generating 
Н, conditional upon n,, m, and лз, аге 


E[H] = n + 1 (У niin) 
and 

var (H) — {> (Ж п? + n + п?) — 2п У п} — т ya — n) 
when a certain null hypothesis Hü is true, always summing over j = 1,2,3 
(Mood, 1940). This Hš is the hypothesis that the x7 are generated by a strict 
white noise process (X1). It is usually assumed that there is no practical 


difference between Но and Hš. For large n, H is approximately normal so 
tests can use 


K = (H — E[H))/Vvar (Н) (6.4.8) 


rejecting Hš (and Ho) at the 5 per cent level if |K| > 1.96. Trends would 
cause fewer runs than expected (so K < 0), price reversals would produce 
more runs (K > 0). One-tail tests can be used if either possibility is 
considered likely. 

The runs test is easy to perform and avoids all problems due to variance 
changes. Its great disadvantage is low test power caused by losing informa- 
tion in the transformation from x, to x7. To see this, assume xf is always 1 
or —1, i.e. suppose there are no zero returns. Then the total runs H are 


1 п-1 5 1 n-1 
H=1+- У xt—-xta)y-z5(n*1- xix (6.4.9) 
4 (£1 2 t-1 


As the average of the xf will be approximately zero, their lag one auto- 
correlation is approximately 


n-1 

ж.ж 
> хх in 
еп 
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because x*? is always 1. Thus H is essentially a linear function of a lag one 
autocorrelation. 

However, the first lag autocorrelations of (X,) and {Xt}, denoted p, and 
рї, generally have | p1| < |р,1. It is therefore harder to detect dependence 
using H (ог K) than it is using л. For example a zero-mean, stationary, 
Gaussian process (X,) has 


pt = EIX X? a] = PIXY = X4) — Р(Х} # XE) 
= 2Р(Х* = Xt44) — 1 = (2/m) arcsin (pj) (6.4.10) 
= 0.64p,. (6.4.11) 


Equation (6.4.10) is quoted by Granger and Newbold (1977, p. 287), whilst 
(6.4.11) follows from the fact that p, must be small. In conclusion, the runs 
test is essentially a special first lag test having less power than r, because it 
uses less information. 

A further problem arises if thin trading causes several no-change runs. 
These may be responsible for tess total runs than expected thereby refuting 
independence (апа Hš) but not the random walk hypothesis (Ho). 


6.5 THE PRICE-TREND HYPOTHESIS 


The idea of trends in prices is a specific alternative to the random walk 
hypothesis. Trends are a popular alternative to randomness, particularly in 
market literature. 

A price-trend is essentially a general movement of prices in a fixed 
direction, up or down. Published evidence for trends is sparse compared 
with publications favouring random behaviour. This may, however, be the 
consequence of using inappropriate random walk tests. Stevenson and 
Bear (1970) and Leuthold (1972) offer some evidence for trends although 
Praetz (1976b) has criticized their methods. 

Trends would imply that prices do not adjust fully and instantaneously 
when new information becomes available. Instead, some new information 
would have to be incorporated slowly into prices. Trends will occur if 
information is used imperfectly, for example if enough people are irra- 
tional or rational but unable to interpret all information quickly and 
correctly. 

The trend idea is covered thoroughly in this book. Autocorrelations 
consistent with trends are now defined and appropriate random walk tests 
are deduced in Section 6.6, then models and forecasts are described in 
Chapter 7 followed by the profitable results of trading rules in Chapter 8. 


Price-trend autocorrelations 


Slow interpretation of a particular information item will cause several 
returns to be partially determined by the same information. The fundamen- 
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tal trend idea is that these several returns are all influenced in the same 
way, either towards a positive conditional mean or towards a negative 
conditional mean. Thus trends will cause positive autocorrelations. The 
impact of that current information which is not fully reflected in the current 
price, upon future returns, should diminish as time goes on. Thus the 
autocorrelations should decrease as the lag increases. We know that all 
autocorrelations must be small. Therefore trends imply: 


р, > 0, p, > рт and 125, all 7 > 0, (6.5.1) 


L.e.1 > p, > p; > р>... > 0. 

To progress to a testable alternative to the random walk hypothesis 
requires parametric autocorrelation functions consistent with the observa- 
tions summarized by (6.5.1). The simplest such functions define the 
price-trend hypothesis, first investigated in Taylor (1980): 


Hip = Ap’, А, р, r> 0. (6.5.2) 


The autocorrelations p, may refer to either the returns X, or the standard- 
ized returns U, = (X, — шу/У,. We use the notation H, to emphasize an 
alternative hypothesis to the random walk hypothesis H;. Once more we 
are considering the autocorrelations of ARMA(1, 1) processes. 

There are two parameters in H,. Parameter A measures the proportion of 
information not reflected by prices within one day. Parameter p measures 
the speed at which imperfectly reflected information is incorporated into 
prices. As A — Oor p — 0, information is used perfectly. When price-trend 
models are credible we will later discover that typical parameter values are 
A = 0.03 and p = 0.95. Low values for A are inevitable whilst values for p 
near to 1 indicate trends lasting for a long time. 


An example 


Many statistical models are consistent with the price-trend hypothesis and 
detailed examples will be presented in Section 7.2. One simple example 
assumes the return X, is the sum of an autoregressive trend component и, 
and an unpredictable residual e: 


X = Be te, (6.5.3) 
шщ m р(ш-1 = ш) + т, 
Ele] = Е[%] = Ее,е,..] = Етті.) = Elem) = 0 


for all s, tand 7 > 0. As 
СОУ (X,, X...) = COV (us, ш) = p var (Qu) 
for all т > 0, it follows that H; is true with A = var (uj)/var (X,). 


Testing the random walk hypothesis 143 


Price-trend spectral density 


From (6.4.3) and (6.5.2), the spectral density function for the price-trend 
hypothesis is a constant times 


f(w) = 1 + 2A > Р” сов (ro) (6.5.4) 


т-1 
= 1— A+A — p’)/{1 — 2р cos (о) + p°}. 


The function f(@) decreases monotonically from ҚО) = 1 + 2Ap/(1 — р) to 
Қт) = 1 — 2Ap/(1 + p). There will be a single thin peak atw = Oif p = 1. 


6.6 TESTS FOR RANDOM WALKS VERSUS PRICE-TRENDS 


Powerful tests of the random walk hypothesis H, against the price-trend 
hypothesis H, сап be constructed using the theoretical distributions of 
sample autocorrelations. Comprehensive details are given in Taylor 
(1982b). We will now derive test statistics 7* and f, given by equations 
(6.6.2) and (6.6.3) respectively. 

Suppose firstly that k sample autocorrelations are to be used inatestand 
they аге the realized values of the vector R = (Ri, Ro, ..., Қ). Assume 
secondly that the asymptotic results of Section 6.3 are applicable to R, with 
E[R] = p. Then Vn(R — p) has a multivariate normal distribution, N(0, Q), 
for some k x k matrix Q. When Ну is true, р = (0, 0, ..., 0) and Q; is the 
k x k identity matrix /,. When H, is true, р = (Ap, Ap’, ..., Ар9 and 
О, = I, if Ais small; for typical parameters (А = 0.036, p = 0.97) the entries 
w, in the matrix Q, have 1.15 > w,, > 1 and 0.22 > w, > 0 (т # ë), see 
Taylor (1982b, рр. 42-3). To motivate practical test statistics it is assumed 
thirdly that the matrix Q, can be replaced by I, even if H is true. 

The alternative hypothesis H, has two unspecified parameters. Now 
consider the best test if these parameters are fixed to give the restricted 
alternative Hï : p, = А*ф". The optimal test of Н, against H1 using observed 
autocorrelations r = (л, ry, ..., л) must use the likelihood-ratio statistic 
whose logarithm is 


1 = log {L(r| H¥)/L(r| H9) 
with L(r|.) denoting the likelihood of the vector r. Applying all the 
assumptions previously stated, 
k 
[= nA* У фт, + 1, 
т=1 


with h a constant independent of r. Thus Hy should be rejected if 
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k 
ke = > OG (6.6.1) 
т-1 


exceeds a number determined by the significance level and the number of 
returns. 

The test parameter ф should reflect prior opinions about the trend 
parameter p. Of course k and $ must be chosen before calculating the r,. 
These choices must be largely subjective, because A and p will be unknown 
if H, is true. One way to choose k and ф is to seek values giving high test 
power whenever A, p, and n fall inside a set 5 of plausible trend parameter 
values and series lengths. 

In 1979, the set S defined by 0.01 < A x 0.04, 0.8 < p < 0.975, and 
250 < n < 2000 was considered appropriate after studying some of the 
copper and sugar returns but none of the other returns tested here. For this 
S the best К is 30 and the best $ is 0.92. These test parameters have power 
close to the best obtainable using the k and $ optimal for any specific triple 
(A, p, n) in 5, see Taylor (1982b, pp. 44-5). (The power calculations used a 
5 per cent significance level and the correct matrix Q,; this matrix was not 
assumed to be /, when H, is true.) The set S is still considered appropriate 
and therefore it is still recommended that tests for trends use k = 30 and 
ф = 0.92. 

When H, is true and standard asymptotic results are assumed, > % R, 
has mean zero and variance > ¢*7/n. Consequently, as n — œ, the test 


statistic 
30 12 
Ton (F 0.9227n] 
т=1 


30 
0.4274V/n У) 0.927, (6.6.2) 
т=1 


T* 


is an observation from N(0, 1) if H, is true. We accept Hj if T* < 1.65 and 
accept H, if T* > 1.65, using a5 per cent significance level. 

The recommended statistic 7* has high power for testing Ho against H, 
because it uses several coefficients r, and exploits the price-trend predic- 
tion of positive autocorrelation, decreasing as the lag increases. Any 
similar linear function of several autocorrelations, say илл + wor, +--+ + 
wr, with monotonically decreasing positive weights w,, is likely to provide 
a fairly powerful test statistic. Choosing the w, proportional to 0.92” may 
appear arbitrary. The Parzen weights А, used to define spectral estimates 
(see (6.4.5)) provide an alternative set of weights w,. The standardized 
density statistic at zero frequency is the following linear function of M — 1 
sample autocorrelations: 


M-—1 M-1 1⁄2 
һ= > IP xn) : (6.6.3) 
т=1 т=1 
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This is the only spectral statistic likely to have high power when the 
alternative hypothesis is price-trends. We reject H, at the 5 per cent level if 
fh > 1.65. The parameter value M has been selected as in Section 6.4. 


6.7 CONSEQUENCES OF DATA ERRORS 


Price series often contain errors. Checking large returns against another 
source will find any large errors which can then be corrected. Small errors 
may remain and sometimes it is impossible to check a series because of 
insufficient information about dates. The primary consequence of errors is 
to decrease ғ), because an error in the price z, causes errors in the two 
returns x, and ху one being positive and the other negative. 

A model for price errors is 


log (Z) = log (Zt) + 6, 


with Z, 2%, and ó, generating the analysed price, the true price, and an 
error respectively with a very high chance that ô = 0. Assuming {6,} is 
white noise, stochastically independent of true returns (Xf), with 
y = var (8,)/var (X1), it can be shown that the theoretical autocorrelations 
p. of the analysed returns are related to those of the true returns р* by 


ру = (рї = ф)/(1 + 2ф), p, = pt + 24) if 7 > 2. 


Estimating wis difficult but it is clear that the major impact of errors occurs 
at the first lag. 

The T* test is particularly vulnerable to data errors. An obvious remedy 
when errors are suspected is to ignore г, and then T* should be replaced Бу 


30 
* = 0.4649 Vn У 0.927r,. (6.7.1) 
т=2 
Ав with 7%, the asymptotic distribution of U* is №0, 1) when the random 
walk hypothesis is true. Simulations have shown that the test power lost by 
using U* instead of T*, when there are no data errors, is very low if H, is 
true and pis nearly 1. It therefore seemed prudent in previous research to 
insure against data errors by using U* for empirical tests. Data errors may 
be rarer than once feared, according to the test results soon to be 
presented. 
The simulations made it clear that asymptotic theoretical results con- 
sistently overestimate the power of T*. This is due, at least partially, to a 
downward bias in the sample coefficients. When H, is true, 


EIR] — p, = —[1  zAp* + 2Ap/(1 — p)l/n, (6.7.2) 


an approximation deduced from Lomnicki and Zaremba (1957, p. 156). For 
example if A = 0.02, p = 0.95, and n = 1000, then the bias F[R,] — p, is 
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about —0.002 and the correct mean of the variable generating 7% is about 
0.27 less than the 1.84 predicted by asymptotic theory. 

Let m = 1/(1 — p) measure the duration of trends, explained properly in 
Section 7.2. Figure 6.1, reprinted from Taylor (1980), shows the regions for 
trend parameters A and m over which T* and U* are estimated to have 
power greater than 0.5, when the significance level is 0.05, the alternative 
to random behaviour is price-trends and there are 500, 1000, or 2000 
returns. 

On a given curve, there is estimated to be a 50 per cent probability of 
falsely accepting the random walk hypothesis when there are in fact trends 
in the stochastic process; north-east of the curves this probability is less 
than 50 per cent. These curves are based on simulations, not asymptotic 
theory. Figure 6.1 illustrates the advantages of long series compared with 
short series. It is certainly desirable for series to span at least eight years 
providing at least 2000 daily returns. 


0.06 


0.04 


0.02 


Figure 6.1 The power of the test statistic U* is approximately 0.5 on the curves 
when the significance level is 0.05. Reproduced from Taylor (1980) by permission of 
the Royal Statistical Society 


6.8 RESULTS OF RANDOM WALK TESTS 


The eleven test statistics considered are T* (equation (6.6.2)), U* (6.7.1) 
and f, (6.6.3) intended to detect any evidence for trends, N, and N, which 
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respectively count the number of significant autocorrelations and spectral 
estimates, r, Vn which relies on any important autocorrelation to occur at a 
single day's lag, Q, (k = 10, 30, 50) summarizing several autocorrelations 
(6.4.1), f, the test fora weekly cycle, and Kthe non-parametric runs statistic 
(6.4.8). All the autocorrelation and spectral statistics are functions of 
autocorrelation coefficients r,. These coefficients have been calculated 
from rescaled returns y,, for the reasons given in Chapter 5, particularly in 
5.7 and 5.8. The coefficients are denoted г, , or sometimes simply г,. 
Readers are reminded that 


y, = (x, — XY, (6.8.1) 

9 = (1— ya *1253y|x 4 — Х| (6.8.2) 
and 

һу= > y- УХУ — y) / > (y - y». (6.8.3) 


t=21 t=21 


See Section 5.8 for complete details about these equations including the 
selection of initial V,. In (6.8.2), y is 0.04 for stock series and 0.1 for the 
others. As with other autocorrelation estimates, the numerator summation 
in (6.8.3) for futures series is over those t for which x, and x,,, are returns 
from the same contract. The number of terms іп the denominator of г, is 
n — 20 and denoted by n*. Whenever necessary n* replaces n in the 
equations defining test statistics. 

Table 6.1 presents г, у, the number of г, , over lags 1 to 30 in each of the 
six classes introduced in Section 2.9 and, for comparison, the first-lag 
coefficient r, , calculated from returns 21 to n inclusive. 

Table 6.2 gives the values of the 11 test statistics for all 40 series. Values 
significant at the 5 per cent level appear in italic. It can be seen that a large 
number of the statistics are significant. Before calculating the test values, 
my preferred test statistic was U*. This statistic is not significant for any of 
the stock series. Indeed the highest stock value is 0.62, far below the 
critical value 1.65. However, U* rejects the random walk hypothesis for 
every commodity and currency series with the single exception of the wool 
series. The other trend statistics, 7% and f, each reject the random walk 
hypothesis for every commodity and currency series. Thus there is evi- 
dence for trends in commodity and currency prices but notin stock prices. 

Looking at the other test statistics for stocks it is seen that АҒАДЫ і5 
significant for all the series, often at avery low significance level. Therefore 
all the long price series are not random walks: their daily returns are notthe 
output of an uncorrelated process having constant mean. The short 
Treasury Bond series does not reject the random walk hypothesis but it has 
insufficient observations to give decisive results. 
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Table 6.1 Autocorrelations r, , for rescaled returns 


Іар 1 1арѕ 1—30, frequency Бу class 

Ny ns 1 2 3 4 5 6 
Spotseries 
Allied 0.072 0.087 0 0 15 14 1 0 
Alcoa 0.187 0.150 0 0 19 10 0 1 
Am.Can 0.133 0.143 0 1 17 11 0 1 
ATandT 0.098 0.088 0 0 21 8 1 0 
Am.Brands 0.095 0.093 0 1 17 11 1 0 
Anaconda 0.047 0.056 0 0 16 14 0 0 
Bethlehem 0.106 0.107 0 0 20 9 0 1 
Chrysler 0.045 0.044 0 1 12 17 0 0 
Dupont 0.131 0.114 0 0 14 15 0 1 
Kodak 0.054 0.035 0 0 16 13 1 0 
G. Electric 0.101 0.099 0 0 14 15 0 1 
G.Food 0.113 0.089 0 0 16 13 0 1 
G.Motors 0.055 0.049 0 0 17 12 1 0 
G.Telephone 0.065 0.051 0 0 19 10 1 0 
Harvester 0.079 0.085 0 0 16 13 1 0 
FT30 0.051 0.039 0 1 13 15 1 0 
Gold —0.012  —0.053 0 0 8 19 3 0 
Silver —0.066 | —0.080 0 1 11 16 1 1 
Copper —0.017  —0.056 0 0 10 19 1 0 
Lead 0.021 -0.020 0 0 6 24 0 0 
Tin 0.030 0.021 0 0 11 19 0 0 
Zinc —0.009  —0.074 0 0 8 22 0 0 
£/$ spot 0.097 0.038 0 0 3 23 4 0 
Futures series 
Corn (12) —0.016 -0.027 0 0 10 18 2 0 
Corn (6) —0.010 0.005 0 0 10 19 1 0 
Corn (3) 0.001 0.013 0 0 9 20 1 0 
Cocoa (12) 0.045 0.015 0 0 8 17 5 0 
Cocoa (6) 0.071 0.034 0 0 7 18 5 0 
Cocoa (3) 0.091 0.070 0 0 8 20 2 0 
Coffee (12) 0.104 0.026 0 0 8 17 4 1 
Coffee (6) 0.131 0.062 0 0 9 18 2 1 
Coffee (3) 0.131 0.063 0 0 8 19 2 1 
Sugar (12) 0.063 0.044 0 0 2 26 2 0 
Sugar (6) 0.084 0.064 0 0 1 27 2 0 
Sugar (3) 0.085 0.064 0 0 3 24 3 0 
Wool (12) 0.040 0.075 0 0 12 18 0 0 
£/$ (6) 0.073 0.030 0 0 5 23 2 0 
ОМ (6) 0.023 0.007 0 0 10 17 3 0 
SF/$ (6) 0.005 0.048 0 0 8 20 2 0 
T-bond —0.051  —0.088 0 3 14 10 3 0 


Note: Autocorrelations for rescaled returns are allocated to these six classes: (1) r « —0.1, (2) 
—0.1 < r < —0.05, (3) —0.05 < r < 0,(4)0 < r < 0.05, (5) 0.05 < r x 0.1, (6) r > 0.1. 
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Table6.2 Values of the random walk test statistics 


T* U* vVn*rn М, Qw Ою Qs № fo f, K 

Spot series 
Allied 0.73 -0.82 3.77 4 23.55 54.20 70.50 6 —0.16 —1.09 —3.11 
Alcoa 2.87 —1.06 9.77 3 102.91 124.40 145.30 9 0.25 0.51 —7.71 
Am. Can 1.84 —0.97 6.94 2 58.30 85.40 99.53 6 0.31 7.92 —4.79 
ATandT 0.19 —1.98 5.114 40.89 64.56 77.214 -0.71 2.97 —4.00 
Am.Brands —0.74—2.92 4.94 3 44.65 63.24 76.96 7 -1.08 3.07 —0.52 
Anaconda —0.31 —1.37 2.431 14.81 23.07 34.78 0 —0.11 0.96 —4.80 
Bethlehem —0.26 —2.65 5.542 45.63 61.40 88.49 4 —1.32 —0.01 –2.72 
Chrysler —0.03 -1.05 2.37 5 26.78 51.05 71.43 2 0.88 —0.16 —1.56 
Dupont 3.22 0.58 6.822 52.04 66.15 75.90 6 1.84 0.27 —3.82 
Kodak —0.42 —1.68 2.84 1 19.91 37.37 54.56 3 0.11 0.22 —3.79 
G. Electric 1.24 —0.92 5.302 38.98 59.61 70.54 4 0.25 7.75 —3.65 
G. Food 0.79 —1.67 5.922 46.13 67.32 87.00 4 —1.09 0.64 —4.55 
G. Motors —0.23 —1.47 2.86 5 25.89 51.00 60.012 -0.38 4.51 —3.99 
G.Telephone —0.28 —1.76 3.402 20.04 40.36 49.00 2 -1.55 1.71 —3.78 
Harvester 0.89 —0.81 4.75 2 24.67 42.17 68.94 2 —0.29 —0.45 —1.56 
FT30 1.41 0.62 2.132 20.31 39.93 49.04 2 0.16 0.88 —1.85 
Gold 4.73 5.36 -0.5 5 29.31 63.26 88.20 3 4.67 —0.83 4.45 
Silver 1.94 3.04 —2.16 З 34.03 49.30 68.64 6 2.54 0.19 3.57 
Copper 4.03 4.84 —1.08 5 37.85 53.85 71.34 4 3.99 —1.11 1.91 
Lead 3.82 3.66 1153 10.33 36.75 66.84 1 5.43 —0.33 1.52 
Tin 2.24 1.74 1.631 15.96 2929 48.47 3 2.83 0.98 —0.56 
Zinc 2.12 2.52 —0.51 1 8.41 23.11 50.22 2 3.46 —0.35 —0.57 
£/$ 6.56 5.29 4.306 35.28 80.60 125.69 3 9.05 1.19 —2.28 
Futures series 
Corn (12) 1.89 2.46 —0.93 2 15.51 36.34 61.09 1 2.10 1.16 1.89 
Corn (6) 3.03 3.54 —0.57 3 22.04 38.64 51.78 3 3.31 2.37 1.75 
Corn (3) 3.22 3.56 0.04 3 27.15 36.07 42.74 3 2.75 2.72 1.79 
Сосоа (12) 6.65 6.24 2.34 9 41.57 83.18 92.76 3 6.39 0.89 —0.85 
Сосоа (6) 6.34 5.40 3.51 6 36.25 72.99 85.16 1 6.42 0.81 —0.60 
Сосоа (3) 6.43 5.08 4463 46.43 65.33 72.56 3 5.99 0.85 —1.34 
Coffee (12) 6.96 5.25 5.42 6 66.60 84.14 100.54 З 5.05 —0.30 —2.91 
Coffee (6) 6.67 4.46 6.53 5 26.36 88.02 101.32 5 6.42 0.92 —2.97 
Coffee (3) 6.20 3.96 6.51 4 71.27 83.92 91.85 5 6.32 2.47 —1.89 
Sugar (12) 7.67 6.61 4.56 7 73.17 105.59 122.70 3 7.26 1.69 —3.42 
Sugar (6) 8.97 7.29 5.76 7 85.01 109.70 132.63 2 8.66 1.28 —4.59 
Sugar (3) 9.26 7.57 5.86 8 92.94 114.51 121.00 3 8.71 1.76 —4.41 
Wool (12) 2.26 1.49 2.274 19.96 42.31 64.64 3 2.05 —0.50 —3.99 
£/$ (6) 3.53 2.46 3.232 16.67 32.70 41.542 3.38 2.03 —1.08 
DM/$ (6) 4.3 4.82 1.013 29.76 56.79 64.08 1 4.88 1.40 1.13 
SF/$ (6) 3.01 3.19 0.213 13.24 42.17 53.28 1 3.01 0.63 0.81 
T-bond —0.02 0.46 —1.14 0 11.39 23.72 38.44 1 —0.29 0.15 2.30 
Critical 

point(s) 1.65 1.65 +1.96 4 18.31 43.77 67.50 4? 1.65 1.65 +1.96 
Notes 


24 for spots and futures (12), 3 for futures (6), 2 for futures (3). 
Values significant at the 5 per cent level appear in italic 
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Table6.3 Frequency of random walk rejections by test statistics 


16 stock series 15 commodity and currency series 
Statistic Frequency Statistic Frequency 
n 16 T* 15 
Ою 15 fo 15 
K 12 U* 14 
Оз 11 Ою 10 
Qs0 11 Озо 8 
N, 9 Qso 7 
f, 6 гу 7 
N, 4 N, 7 
T* 3 K 6 
fy 1 N, 4 
U* 0 i 2 


Note: Frequency records the number of series whose test values are significant 
at the 5 per cent level. 


Returns from different series on the same day can be correlated. For 
example, there is low correlation for any pair of US stocks and any pair from 
lead, tin, and zinc, with very high correlation for any two futures series 
defined for the same commodity. This is not particularly serious for the 
stocks or metals. Test results for any two corn series are highly dependent 
and likewise for cocoa, coffee, and sugar. The correlation between test 
statistics is described further in Appendix 6(A). 

Many researchers have claimed the random walk hypothesis is false so 
our rejection of the hypothesis is not particularly surprising. Using the 
trend statistics makes the conclusion of non-random behaviour clearer for 
several series. Working with long series allows the rejection of random 
behaviour for every series. It is interesting to note which statistics reject the 
null hypothesis most often. Stock series and the other series are discussed 
separately as their stochastic processes are clearly different. 


Stocks 


Table 6.3 shows the number of times the random walk hypothesis is 
rejected, by test statistic, for the 16 stock series. Only r,Vn* rejects for 
every series. The high scores for the Q, statistics are entirely due to the 
dominant contribution of n*r? to the total n*(rj + --- + rk); a modified 
statistic n*(r5 + ++ + rz) rejects for very few series. The runs statistic Khas 
a high score because it is essentially a first-lag coefficient too (Section 6.4). 
Only six series show significant evidence for the weekly cycle caused by 
lower than average returns on Mondays (Section 2.5). The scores for all the 
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Figure 6.2 Autocorrelation coefficients for the stock American Telephone and 
Telegraph, shown for returns (+) and rescaled returns (x). Dotted lines indicate 
+1.96/Vn* 


trend statistics are very low. Reducing yin (6.8.2) from 0.04 to 0.02 changes 
the score for r,Vn* to 15 (the index series does not reject the null 
hypothesis), whilst for y — 0.08 the score is still the maximum possible. 

The autocorrelations of US stock returns have two interesting features. 
Firstly, there is significant positive autocorrelation at lag 1, also noted by 
Perry (1982) for more and longer series. It is unlikely that a trading rule 
could exploit the small autocorrelation. Secondly, the majority of co- 
efficients at higher lags are negative, shown previously by Fama (1965). For 
rescaled returns, 67 per cent of the coefficients are negative at lags 2 to 10 
inclusive, 54 per cent are negative over lags 11 to 30. For lags 2 to 10, 24 of 
the 135 coefficients are less than —1.96/V/n*; 6 of these 24 occur at lag 2. 

The coefficients r,, and the spectral estimates f(w) for AT and T are 
plotted onFigures6.2and6.3respectively. This stock has the median first co- 
efficient (0.098) for the 15 series. Coefficients r, x, calculated from returns, 
are also shown on Figure 6.2. It is clear that the greatest autocorrelation 
occurs at lag 1 and only at lag 1 are the coefficients highly significant (the 
dotted lines on Figure 6.2 indicate 1.96/\/n* and —1.96/\/n*). The greatest 
spectral density corresponds to a five-day cycle. This series is one of the six 
stock series having Á, significant at the 5 per cent level. 
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Figure 6.3 Estimated spectral density for American Telephone and Telegraph 

stock calculated from rescaled returns. Graph shows Қо) plotted against w/z. 

Estimates above the upper dotted line or below the lower line are significant at the 
5 per cent level 


Commodities and currencies 


It has already been noted that test results for any pair of futures series for 
the same commodity are similar. Therefore only one series per commodity 
is considered when counting the number of times each statistic rejects the 
random walk hypothesis. The series taking six-month blocks for prices 
from each contract have been chosen for corn, cocoa, coffee, and sugar. 
This block size avoids any thin trading more than six months before 
delivery. It also provides less biased coefficients than series using three- 
month blocks; bias occurs because there are less than n* — 7 products in 
the numerator of r, , for futures series. 

Fifteen series are now discussed, covering metals (six spot series), farm 
products (five futures series) and currencies (one spot and three futures 
series). Table 6.3 lists the number of times the random walk hypothesis is 
rejected, by test statistic, for the 15 commodity and currency series. The 
trend statistics outscore all the other statistics. Both 7* and f, reject for all 
15 series, U* rejects for 14 and the next statistic only rejects 10 times. The 
first-lag statistic and its non-parametric alternative, the runs statistic, only 
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Figure 6.4 Autocorrelation coefficients for gold returns (+) and rescaled 
returns (X) 


score 7 and 6 respectively. There is minimal evidence for weekly (or any 
other) cycles; f, scores 2. Other values of y in (6.8.2) give similar results. 
For y = 0.05, 7* and U* both score 14, for y = 0.2, T* scores 15 and U* 
scores 13. 

Clearly trend statistics are best for tests on commodity and currency 
series whilst first-lag statistics are best for stock series. Different models 
are obviously needed for the two types of series. Models having positive 
autocorrelation at several lags are indicated for commodities and curren- 
cies. About 81 per cent of the coefficients at lags 1 to 10 are positive for the 
15 series, 77 per cent positive over lags 11 to 20 and 63 per cent positive over 
lags 21 to 30. This preponderance of positive coefficients suggests that 
trend models, with p near to unity, will be close to the unknown process 
generating the observed series. 

Figures 6.4, 6.5, and 6.6 show the autocorrelations г, , for gold, sugar, 
and £/$ futures; the r, , are also plotted. The first twenty sugar coefficients 
г.у are all positive. Every statistically significant г,, on these figures is 
positive. Notice that the r, , and r, , can be quite different relative to the 
scale used. Also notice the higher variance of the r, ,, especially on Figure 
6.4. Figures 6.7, 6.8, and 6.9 display the spectral estimates f(w) for the same 
three series. Each figure shows a sharp peak at zero frequency. 
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Figure 6.5 Autocorrelation coefficients for sugar futures returns (+) and rescaled 
returns (X) 
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Figure 6.6 Autocorrelation coefficients for sterling futures returns (+) and 
rescaled returns (X) 
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Figure 6.7 Estimated spectral density for gold calculated from rescaled returns 
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Figure 6.8 Estimated spectral density for sugar futures calculated from rescaled 
returns 
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Figure 6.9 Estimated spectral density for sterling futures calculated from rescaled 
returns 


Table 6.2 can be used to see if dependence between futures returns is 
greatest when the contract still has several trading months remaining, 
perhaps due to thin trading. The test results show that dependence is not 
notably greater for series using twelve-month blocks than it is for series 
using six-month blocks. The same conclusion holds when six- and three- 
month blocks are compared. Autocorrelation among futures returns is not 
caused by thin trading several months before the delivery date. 


About the rest of this chapter 


The results of random walk tests applied to rescaled returns show that 
stock, commodity, and currency prices are not random walks. Thus we 
have established that daily returns are not uncorrelated observations from 
distributions having identical means. The next section discusses results 
from applying tests to the returns instead of the recommended rescaled 
returns. These results also show that prices are not random walks. The ease 
with which the random walk hypothesis is rejected is partly due to using 
long series and partly due to choosing an appropriate test statistic. This is 
demonstrated in Section 6.10. 
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Expected returns need not be constant even when prices perfectly 
reflect relevant information. A more general null hypothesis is investigated 
in Section 6.11 based upon returns having expected values determined by 
someappropriate equilibrium theory. Testsofthe more generalhypothesis 
will be shown to give very similar results to random walk tests. It then 
follows that some relevant information is reflected in prices during the 
days following its first impact upon the market. 

Sections 6.11 and 6.12 show that the apparent inefficient use of informa- 
tion extends beyond calendar effects and is not caused by market rules 
restricting prices nor by dealers buying and selling at different prices. 
Figures given in Section 6.13 suggest the inefficiencies occurred through- 
out the full period of the long time series. 


6.9 SOME TEST RESULTS FOR RETURNS 


It was shown in Chapter 5 that tests based upon the autocorrelation 
coefficients of returns will be unreliable because the variances of the 
coefficients exceed 1/n when the random walk hypothesis is true. Now let 
Т? and Ту respectively denote 7% when coefficients г, , and г, аге used. А 
very surprising conclusion is revealed when the tests are evaluated using 
the coefficients г, ,: for each of the 40 series Т; is less than 7), often by a 
considerable amount. Table 6.4 shows the values of T7, T;, and Ту — Т. 
The number of random walk rejections, atthe 5 percent level for T7, is only 
6 out of 15 for the commodity and currency series. A majority of series and 
lags have г,, > fx; see Table 6.1 for first-lag coefficients. The rejections 
count for r, V n is 14 out of 16 for the stocks series. 

Considering random variables R, хапа К, у, it appears true that E[R, x] < 
E[R, y] and this explains the observed conclusion 7; < Ту. The observation 
Tt < Ту forall series is itself evidence against the random walk hypothesis. 
Random walks have been simulated to confirm this deduction. One set of 
40 series, each containing 2020 returns, was created by simulating Gaussian 
white noise. The mean difference г), — r;, was 0.0002 and the mean for 
Ty — TY was 0.06 with T% > TY for 13 series. Two values of T* and three 
values of T£ were significant at the 5 per cent level. Another set of 40 series 
had fluctuating conditional variances with X, = V,U,; {log (М,)} was Gaus- 
sian with variance 0.25 апа autocorrelations 0.985 ‘and (U,) was Gaussian 
white noise. Again there were 2020 simulated returns in each series. The 
mean for т, — ri, was —0.0055, the average of T£ — Ту was —0.13 and 21 
series had 7; > Ту. It can safely be concluded that real price series are not 
random walks. Two values of 77 and six values of Т? exceeded 1.65 in the 
second set of simulations. Of course, the standard deviation of T* is more 
than one for these simulations; the estimated value was 1.37. The ry were 
calculated using y = 0.1 in all the simulations. 


158 Modelling financial time series 


Table6.4 Trend statistics for returns and rescaled returns 


T: T: T — Tš 

Spotseries 

Allied —0.06 0.73 0.79 
Alcoa 1.72 2.87 1.15 
Am. Can 0.87 1.84 0.97 
ATandT —0.17 0.19 0.36 
Am. Brands —1.89 —0.74 1.15 
Anaconda —1.30 —0.31 1.00 
Bethlehem —0.62 —0.26 0.36 
Chrysler —0.64 —0.03 0.60 
Dupont 2.23 3.22 0.99 
Kodak —1.03 —0.42 0.61 
G. Electric 0.15 1.24 1.08 
G. Food 0.41 0.79 0.38 
G. Motors —0.86 —0.23 0.63 
G. Telephone —0.45 —0.28 0.17 
Harvester 0.22 0.89 0.67 
FT30 0.84 1.41 0.57 
Gold 0.58 4.73 4.14 
Silver 0.19 1.94 1.75 
Copper 1.38 4.03 2.65 
Lead —0.74 3.82 4.57 
Tin —0.32 2.24 2.57 
Zinc —0.87 2.12 2.99 
£/$ 0.91 6.56 5.65 
Futures series 

Corn (12) 1.67 1.89 0.22 
Corn (6) 2.96 3.03 0.07 
Corn (3) 2.78 3.29 0.51 
Cocoa (12) 3.22 6.65 3.44 
Cocoa (6) 2.94 6.34 3.40 
Cocoa (3) 4.03 6.43 2.40 
Coffee (12) 2.17 6.96 4.80 
Coffee (6) 2.85 6.67 3.82 
Coffee (3) 2.01 6.20 4.18 
Sugar (12) 4.58 7.87 3.29 
Sugar (6) 5.62 8.97 3.35 
Sugar (3) 5.49 9.26 3.77 
Wool (12) 1.12 2.26 1.14 
£/$ (6) 1.38 3.53 2.15 
DMY/$ (6) 1.83 4.83 3.00 
SF/$ (6) 2.72 3.01 0.30 


T-bond 0.13 0.46 0.33 
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A fairly satisfactory explanation of the positive differences Ту — Ту will 
be provided in Chapter 7. Institutional factors cannot suffice as the positive 
differences occur for a diverse set of markets. The rescaling transformation 
from x, to y, must play some part in an explanation; the positive differences 
increase as y increases for all the stocks but only half the other series. The 
most likely explanation is that the non-random component of returns is 
relatively large when the conditional standard deviation is relatively small. 
This idea will be explored for trend models in Sections 7.3 and 7.4, and 
there shown to provide a possible explanation for the commodity and 
currency series. 


6.10 POWER COMPARISONS 


The evidence for the claim that none of the series is a random walk is 
very strong. Table 6.3 shows the relative power of the test statistics if 
the claim is accepted. Clearly choosing an appropriate test increases 
power. Test power will increase as the number of observations increases. 
These properties of test power are now illustrated by numerical 
examples. 

Firstly, following Taylor (19826, pp. 55-6), consider test power when the 
price-trend hypothesis (H1) is true. The trend statistics ought then to have 
higher power than other statistics, just as the converse would hold for 
other types of non-randomness, for example a first-order moving average. 
Quantification of the power gained when H, is true can help to show if the 
observation that 7% rejects the random walk hypothesis more frequently 
than r;, etc., can be used to support trend models as an explanation of price 
behaviour. 

Returns were simulated for the following model consistent with 
Hi: X, = ш + &, the trend д, is either u; with probability р or a new 
independent value with chance 1 — p, var (ш) = A var (ej) and the e, are 
strict white noise. Series of 1500 returns were simulated with A — 0.034 and 
p = 0.944. The model was twice replicated 1000 times with the e, first 
normal distributed and second distributed as Student's t with six degrees 
of freedom. The trends 4, had normal distributions. Independent realiza- 
tions of identical distributions were used, so the rescaling transformation 
was neither required nor used. 

Tests on weekly and monthly data are often reported. To compare their 
power with tests performed on daily data, returns over one week and four 
weeks were computed in the obvious way and then the first lag coefficients 
rw and л.м were calculated. For trend models, the theoretical auto- 
correlations p, of returns aggregated over k days are related to the daily 
first-lag autocorrelation p; and m — 1/(1 — p) thus: 
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For the simulations, p, = 0.032,p,s = 0.115, p; ж = 0.169 and the expected 
values of the test statistics Vnr,, V(n/5)r, уу, and V(n/20)r, m are approxi- 
mately 1.24, 1.99, and 1.46. 

Table 6.5 lists the estimated powers of the various tests for a 5 per cent 
significance level, when A = 0.034, p = 0.944, and n = 1500. The powers 
for the two distributions are very similar. The trend statistics T*, U* and f, 
are far more powerful than their competitors for the realistic values of A, p, 
and n. Thus the higher scores of the trend statistics in Table 6.3 for 
commodity and currency series are indeed evidence that price-trend 
models are appropriate for such series. As expected, the best first-lag 
coefficient for detecting trend behaviour uses weekly returns. 

Secondly, consider test power as a function of the number of returns. 
The test statistic rn „Vn* has been calculated for the first n* = 250, 500, 
1000, and 2000 rescaled returns in each of the US stock series. Four of the 
fifteen test values are significant at the 5 per cent level when n* = 250, 
increasing to six significant when n* = 500, eight significant when n* 

= 1000 and fourteen significant when n* = 2000. The share index series 


Table 6.5 Estimated powers for 1500 observations from a 
trend model 


Percentage rejections of the random 


walk hypothesis 
Statistic Normal residuals Student residuals 
T* 87 86 
U* 84 85 
fy 78 77 
hw 57 61 
Ою 46 47 
Ою 38 39 
гум 35 34 
г 33 34 
О 31 32 
K 26 36 
Nc 7 8 
f, 3 4 


Notes: Trend parameters À = 0.034, p = 0.944. Significance level 
5 per cent. 

Reprinted with permission from the Journal of Financial and Quan- 
titative Analysis (Taylor, 1982b). 
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gives asignificant result for n* 2 500. Likewise the test statistic T£ has been 
calculated for subsets of the commodity and currency series. Contracts 
covering one, two, four, or eight years were used for the futures series. For 
the 15 series discussed in Section 6.8, only one value of Ту is significant 
when n* = 250, five are significant when n* = 500and nine are significant 
when n* = 1000. The statistics are significant for every series having more 
than 2000 observations. 

These results show that it is highly desirable to study series containing at 
least 2000 returns. A study of several shorter series, say 1000 returns in 
each, will probably give some significant and some non-significant test 
values. It would then be difficult to state clear conclusions. 


6.11 TESTING EQUILIBRIUM MODELS 


So far the null hypothesis has been defined to include the joint assump- 
tions that returns are uncorrelated and have a constant mean. When this 
hypothesis is true the conditional expected returns, F[X,|/, 11, will in 
practice depend neither on the time t nor upon the observed past returns 
11. Random walk tests implicitly assume a model of market equilibrium 
for which the equilibrium expected return is constant. Such a model is an 
approximation to realistic theories of equilibrium expectations. The 
approximation is usually considered satisfactory when testing the 
hypothesis that prices perfectly reflect available information (Fama, 1976, 
Ch. 5, especially pp. 149-51). 

Let /*_, beaset representing information used at time t — 1 to establish 
equilibrium expectations for the next day. The set /*_, need not contain 
1,1. Also let u, denote the equilibrium expected return, namely Е[Х,| 1% 4], 
and define the excess return to be e, = X, — m. Теп excess returns will be 
uncorrelated if a market uses information perfectly but returns would then 
appear to be slightly autocorrelated if the u, change with time. It is now 
shown that any such autocorrelation among returns is very small and can 
usually be ignored. 

In certain equilibrium models и, and E[X,| /, |1 are always non-negative. 
The price process (Z,) is then a submartingale since Е[2,|2;_1, 2,2, ...] > 
2—1. 


Stocks 


The standard equilibrium model for stock returns is the capital asset 
pricing model (CAPM) attributed to Sharpe (1964) and Lintner (1965) and 
explained by Fama (1976) and Brealey and Myers (1984) among others. The 
model relates the equilibrium expected return for a particular stock to the 
same quantity for the whole market: 
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In this equation, Xn is the market return, B, measures the stock’s risk and 
Ta is the risk-free rate of interest; the model has been stated for conveni- 
ence in nominal terms. Let г, be the term in curly brackets, called the risk 
premium. The beta-coefficient B, can change with time and its weighted 
average over all stocks is unity. To simplify the discussion suppose 8, = 1 
forall t. Thisis notaserious simplification forthe following argument. Then 
the return X, conditional upon /7 4 = (ra, гы, Be} is 


XP = gu + e, = ra + Tot + 6. (6.11.1) 


Itis a matter of simple observation that interest rates г, change with time. 
Financial theory requires г, to be positive but does not assume the risk 
premium is constant, although measuring changes in the premium ís very 
difficult (Merton, 1980). If we are willing to model the equilibrium expected 
returns by a stationary process, then perfect market efficiency implies 


cov (e,, е,,;) = 0, т> 0, 
апа 
COV (Xe, Xtar) = COV (ш, ш-)- 


Hence the autocorrelations of a stationary returns process for a perfect 
efficient market are 


Prix = COV (шщ, pus var (Xj). 


In these circumstances an upper bound p* for the autocorrelations is 
given by 


ІР-х| < var (u,)/var (X) = p*. (6.11.2) 


Given ш, the equivalent expected return in annual terms is 
exp (2504, + 12507) — 1 assuming e, ~ N(0, о?) and 250 trading days іп а 
year. Over a period of ten years it would be likely that the range for annual 
expected returns was less than 20 per cent, especially for returns before 
1980. The corresponding range for w, say и’ to u”, will then be constrained 
ру 250(и” — ш’) < log (1.2). A reasonable bound for var (w) is given by the 
variance of a uniform distribution over the interval и’ to u”, namely 
(и” — u'Y/12. A typical stock has returns variances (0.015)? (Section 2.4). 
Thus one estimate of the upper bound in (6.11.2) is 


ô = [log (1.2)/250]2/[12(0.015)2] = 0.0002. (6.11.3) 


If we follow Merton (1980) and postulate a relationship г, = ау! for 
some j, v, being the conditional standard deviation of e,, then p* сап be 
calculated exactly from the distribution of V, if r, is assumed constant. 1 
have considered log (Vj) ~ N(—4.4, 0.452), cf. Section 3.9, and assumed [ot 


Testing the random walk hypothesis 163 


corresponds to a 10 per cent annual premium, approximately, when v, 
equals the median value of its distribution. Then r,, = 0.025v, gives р* 
= 0.0001 whilst r,, = 1.6у? gives p* = 0.0007. 

Clearly if stationary processes are assumed it is impossible for day-to-day 
changes in the equilibrium expected return to be the cause of important 
autocorrelation among stock returns. The variation in и; is trivial compared 
with the variation in excess returns e,. Other equilibrium models, such as 
the arbitrage pricing theory investigated by Roll and Ross (1980), will give 
the same conclusion. Returns on market indices compared with returns on 
individual stocks have similar risk premia but smaller variances and hence 
p* is higher. 


Simulation results 


Computer simulations confirm that changes in equilibrium expected 
returns have little effect upon the sample autocorrelations of returns and 
rescaled returns. Three sets of 40 random walks have been simulated and 
statistics such as rj x, љу, Tx, and T; calculated. Returns in each series have 
had hypothetical equilibrium terms и, added to them and then the statistics 
have been recalculated. Notation like Ar; x will represent the statistic ғ), for 
a series {д, + e} minus the statistic for the series {e,}. Differences Ar, ,, 
AT¥ etc., measure the effect of the time-dependent expected returns. 

All the simulated series have 2020 returns with 2000 used in all tests. The 
simulated processes have average returns variances equal to 0.0157, The 
first set of random series are Gaussian white noise. Set two contains series 
whose variance multiplies by four after half the observations, each half 
being Gaussian white noise. The third set is an appropriate product 
process with parameter values based on Tables 3.2 and 3.3. Separate 
random numbers were used for each series. 

Firstly, a long trend in equilibrium expected returns was investigated, 
defined by д, = aot with a chosen to make the expected annual return 
increase from 0 to 20 per cent over eight years. Table 6.6 lists the average 
values of Ar,,,Ar,,, АТ, ATS, and A(T% — T¥) andalso gives the standard 
deviations in brackets, separately for each set of 40 series; standard errors 
for the means are estimated by the standard deviations divided Бу V/40. All 
rescaled returns were obtained using y — 0.1. Secondly, the risk premium 
was set proportional to the conditional standard deviation of returns. 
Assuming a constant risk-free rate, д, = rf + ауу, for some number а). 
Following the argument given after equation (6.11.13), a, was chosen to be 
0.025. Table 6.6 summarizes the results for the second and third sets of 
series. As the first set has constant v, and hence constant д,, no new results 
are then obtained. Thirdly, the equilibrium expected returns were 
assumed to be a constant (rf) plus a;v?, with a; = 1.6; again see Table 6.6 
for a summary of the results. 


Table 6.6 Summary of simulation results for various expected returns models 


Model? for y, Set? Ағ, Any AT: AT? AT; – Tz?) 
at 1 0.0004 “0.0006% ^ 0.0004(0.0006) 0.07(0.12) 0.08(0.12) 0.01(0.03) 
2 0.0002 (0.0005) 0.0002(0.0006) — 0.05(0.11) 0.06(0.13) 0.01(0.05) 
3 0.0004(0.0005) 0.0007(0.0007) 0.09(0.11) 0.13(0.15) 0.04(0.11) 
rf + ayy, 2 0.0001(0.0002) 0.0001(0.0003) 0.02(0.05) 0.03(0.07) — 0.01(0.03) 
3 0.0004(0.0007) 0.0005(0.0007) 0.07(0.13) 0.09(0.12) 0.02(0.08) 
rf + av 2 0.0003(0.0004) 0.0004(0.0006) — 0.06(0.09) 0.08(0.13) 0.02(0.05) 
3 0.0016(0.0036) 0.0011(0.0015) — 0.28(0.70) 0.21(0.28) -0.08(0.48) 
Day effects 1 —0.0031(0.0028) — —0.0029(0.0027) 0.01(0.09) 0.00(0.09) -0.01(0.04) 
2 0.0003(0.0040) -0.0003(0.003) — 0.01(0.10)  —0.04(0.10) —0.05(0.09) 
3 —0.0028(0.0022) — —0.0045(0.0033) 0.07(0.11)  —0.03(0.12) -0.10(0.10) 
Month effects 1 0.0009(0.0009) 0.0007(0.0012) — 0.10(0.15) 0.10(0.16) — 0.00(0.06) 
2 0.0012(0.0015) 0.0019(0.0019) — 0.14(0.19) 0.28(0.23) 0.15(0.14) 
3 0.0005(0.0009) 0.0020(0.0018) 0.04(0.11) 0.25(0.23) 0.21(0.23) 


Defined іп Section 6.11. 


^ Conditional variances are constant for Set 1, have a step change for Set 2, and follow an autoregressive, lognormal process for 


Set 3. 


“Sample mean for a sample of size 40. 
?Sample standard deviations are bracketed. 
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The results from these simulations show that changes іп д, indeed 
induce negligible autocorrelation for almost all series. When и, depends 
on w and high values occur for v, there can be more substantial induced 
autocorrelation. For the third set one series had Аг), = 0.0199, 
Ал, = 0.0073, АТ; = 4.17, and AT; = 1.47, the next largest differences 
were far smaller at 0.0093, 0.0036, 1.38, and 0.51, whilst four of the 7% with 
changing д, were significant compared with two significant values for 
constant и,. 

Further simulations have incorporated the calendar effects described in 
Section 2.5. These time-dependent expected returns represent market 
imperfections rather than equilibrium expectations, in the absence of a 
satisfactory equilibrium theory. Firstly, ми, for Mondays minus д, on any 
other day was defined to be —0.0023 and the standard deviation of e, was 
multiplied by 1.2 on Mondays, whilst keeping the overall average variance 
at (0.01577. These simulations provide the fourth group of results in Table 
6.6. Secondly, џи; іп January minus шіп any other month was defined to be 
0.0018. This gives the fifth group of results in Table6.6. The table shows that 
calendar effects induce more measured autocorrelation than can be at- 
tributed to a plausible equilibrium model. As predicted theoretically, it is 
seen that adding US calendar effects to an uncorrelated series makes the 
first-lag coefficient negative (positive) on average when only weekly 
(monthly) effects are considered. Combining the effects gives an expected 
negative first-lag autocorrelation. 

The final column of Table6.6 shows that none of the models for expected 
returns can explain the large positive differences 7% — Tz observed for the 
40 real series. Monthly effects could explain some but only a small fraction 
of the positive differences. 


Tests 


Let Hoe denote the null hypothesis that there is zero autocorrelation among 
excess returns fora particular model of market equilibrium and/or calendar 
effects. Tests of Hoe Сап be constructed by using an estimate ó of the upper 
bound p* appropriate to the null hypothesis and then altering standard 
random walk tests. For example, the test based on л, , would reject Ho, at 
the 5 per cent level if 


[ri Vn*| > 1.96 + руп“. 


The adjustment added to 1.96 is only 0.02 if n* = 2500 апар = 0.0004. This 
would be reasonable if calendar effects are ignored. Even if ó was as high as 
0.004 the statistic r,, V/n* would reject Hoe for all the stock series. 
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Similarly a test based on T* rejects Hoe if 


30 
Tt > 1.65 + 0.42749 /n* У 0.927 = 1.65 + 4.516Vn*. 
T=] 
The term added to 1.65 would usually be small although the required 
adjustment to 77 is greater than that to ЖАЛЫ 


Other equilibrium models 


At a theoretical level and assuming perfect markets, it is plausible to 
assume futures have zero equilibrium expected returns (Section 2.3). 
Returns from holding commodity inventories ought to depend on at least 
interest rates and perhaps also risk premia and convenience yields. Tests of 
Moe for spot commodity returns thus require a relevant estimate of p*. As 
the variances of such returns are greater than for stocks, one possibility is 
to use the stocks estimate ignoring calendar effects, i.e. 0.0004. Then Hoe is 
rejected for all six metal series using an adjusted ТУ and for all six excepting 
tin using an adjusted Ur. Another possibility is to be very cautious and 
allow for unknown calendar effects too. There are at present no results 
linking metal returns to the calendar but choosing p* — 0.003 looks safe 
from Table 6.6. Then three of the six series would reject Hoe using an 
adjusted T% and five still reject using an adjusted у. 

Equilibrium expected returns from spot currencies will depend on 
interest rates. Suppose for example the relevant one-day rates are , and 
ін, for deposits in the US and the UK from time t — 1 to t and let z, be the 
number of dollars sold for one pound on day t. Adopting our usual 
definition of the return x, from holding an investment, i.e. the change in 
price logarithms, the equilibrium expected return will be approximately 
lc, — А. The approximation is almost perfect if spot traders are risk-neu- 
tral or there is no risk premium. The interest rate differential is continually 
changing. Its range over a decade is likely to be much less than the 20 per 
cent assumed to derive the upper bound (6.11.3) for stocks. However, the 
returns variance for stocks has been about six times as large as for 
currencies. A satisfactory upperbound for most currencies will be six times 
the figure іп (6.11.3), i.e. 0.0012. An adjusted Ту or U% will then comfortably 
reject Hoe at the 5 per cent level for the spot £/$ series. 


Conclusion 


Subtracting equilibrium expectations and calendar effects from returns 
has little effect upon tests for randomness. The weak dependence between 
returns claimed earlier is not caused by the variables considered in this 
section. 
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6.12 INSTITUTIONAL EFFECTS 


Many futures exchanges limit the amount prices can change from day to 
day. Dealers at spot and futures markets quote separate bid and ask prices. 
These institutional features prevent markets determining one price reflect- 
ing all relevant information. We now consider the consequences for 
random walk tests. 


Limit rules 


Various US futures contracts cannot be traded at prices differing by more 
than a certain amount from the previous day's close or settlement price. 
The IMM rules governing sterling futures for the period studied in this text, 
1974 to 1982, prohibited trading at a price $0.05 above or belowthe previous 
day's settlement with two exceptions: wider limits are defined after con- 
secutive limit days and there is no limit in the weeks just before delivery. 
Only 10 of the 1991 sterling settlement prices are constrained by the limit 
rules and no consecutive limit days are observed. Roll (1984) describes limit 
rules for orange juice futures and finds limit moves are relatively common. 
One or both of two contracts moved the limit on slightly over 10 per cent of 
the trading days in his sample. 

UK futures have less exacting rules. Cocoa trading, for example, stops 
for half-an-hour after the price moves more than a fixed amount from the 
previous close but after the break prices are no longer restricted. Sugar 
trading is similar and coffee prices are never limited. Thus UK futures 
prices at the close are very rarely restricted. 

Limit restrictions will cause positive autocorrelation among returns 
when information is used perfectly. When a market closes limit-up, sub- 
sequent prices must include the upward movement prevented on the limit 
day. A high positive return on the limit day will be followed by one or more 
returns having higher than average (conditional) expected values. Similar 
dependence occurs for limit-down days. 

Simulations can give approximate values for the autocorrelation induced 
by limit rules. The results can only be approximate because it is impractical 
to simulate intra-day price changes. A simulated random walk defines a 
series of perfect market prices (2%). This series defines market limited 
prices {z,}, for some limit parameter 0, by: 


2-27 if (1 + 0)z,-, > z* > (1 — 0)2, |, 
=(1+6)z,,  ifz* > (1 + 062,1 (6.12.1) 
= (1 — 0z.4 if zt < (1 — 8)z, 4. 


Keeping 0 constant for all t is convenient and unlikely to affect the 
conclusions. Clearly intra-day price movements are ignored: for example 
if 0 = 0.04, z$ = 100, and z* falls all morning to 95 but closes at 21 = 105, 
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Table6.7 Summary of simulation results for a simple limit rule 


Average values of 


Percentage 
Ao limit days An. Any AT; АТУ 
Set? 
1.5 14.6 0.10 0.09 2.10 1.83 
2 4.9 0.04 0.03 0.66 0.56 
2.5 1.3 0.01 0.01 0.17 0.13 
3 0.3 0.00 0.00 0.03 0.03 
Set2 
1.5 10.6 0.14 0.07 4.59 1.82 
2 5.2 0.09 0.03 2.59 0.86 
2.5 2.7 0.06 0.02 1.57 0.44 
3 1.5 0.04 0.01 0.99 0.24 


Notes: Conditional variances are constant for Set 1 and follow an autoregressive, lognormal 
process for Set 2. Percentage limit days refers to days when the market limited price is not the 
perfect market price. 


then itis seen that z; — 104 whereas the correct market limited price is 96. 

Zero risk premia were assumed in the simulations. Then, if simulated 
random returns х? come from a stationary process having variance c^, the 
autocorrelation induced by limit moves depends on the ratio 8/o. Two sets 
of forty series (zz) have been simulated. As in Section 6.11 each series 
gives 2020 returns. The first set has Gaussian white noise returns. The other 
set has substantial fluctuations in conditional standard deviations V,, the 
returns being uncorrelated with log (V,) having standard deviation 0.6 and 
autocorrelations 0.9857. 

Table 6.7 summarizes the induced autocorrelation as a function of б/о. 
Notation like Ar, , refers to r,, for market limited returns minus n, for 
perfect market returns. Appropriate ratios 6/о have been between 1.5 and 
2 for orange juice and more than 3 for currencies. From Table 6.7 it is 
suggested that limit moves need to be considered carefully if more than 2 
per cent of the prices are restricted by limit regulations. 

It is clear, as should be expected, that limit rules induce far more 
autocorrelation among returns than among rescaled returns when con- 
ditional variances fluctuate. Therefore the significant autocorrelation 
claimed for the futures investigated in this book cannot be caused by limit 
rules because their returns are less autocorrelated than their rescaled 
returns. It appears that limit rules do not affect the random walk tests for 
the markets studied here. However, as shown by Roll (1984), limit rules at 
other markets need to be investigated in more depth. 
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Bid—ask spreads 


Brokers and dealers quote different buying and selling prices. The differ- 
ences are very small for a high volume asset. For most futures the bid-ask 
range isless than 0.1 per cent of an average price on most days. The range 
for stocks is generally higher in percentage terms. In Section 2.1 it was 
recommended that the average of the bid and ask prices is used to define a 
closing price. If, however, only one of bid and ask is available to a 
researcher then some spurious autocorrelation may occur. This is now 
claimed to be negligible. 

Firstly, suppose the analysed price z, is either a bid or ask price but it is 
not known which it is. Let z be another way to measure the closing price, 
perhaps the bid and ask average or perhaps some equilibrium price. Then 
let ó, be defined by z, = (1 + ó,)z*. It is appropriate to assume the ó, are 
uncorrelated with zero mean. The theoretical autocorrelations of returns x, 
calculated from the z, will be lower than those for returns x7 derived from 
the 2%. In particular, for stationary processes and uncorrelated X7, the 
spurious autocorrelation is approximately 


pix = --уаг (6,)/var (X), Pr,X = 0 for т> 1. 


Considering futures, when |ó,| << 0.001 and var (X,) 2 (0.002)?, the first-lag 
autocorrelation is between 0 and —0.0007. 

Secondly, suppose z,is always the selling price so that every 6, is positive. 
Making reasonable assumptions about the ó, any spurious autocorrelation 
will be zero or negative. These assumptions, for a stationary process {6,} 
having autocorrelations p,, are p, 4 — p, > p, — p,44 > 0 for all positive т; 
a satisfactory example is the autocorrelations $”, ф > 0. 

Bid-ask spreads are most unlikely to cause spurious positive autocorrela- 
tion. Therefore such spreads cannot explain the significant positive values 
of the random walk test statistics. 


6.13 RESULTS FOR SUBDIVIDED SERIES 


The results presented in this chapter show that some information is not 
reflected correctly by prices as fast as it could be. There have been 
considerable advances in communications technology in recent years so 
we might expect the amount of dependence between returns to decline as 
time progresses. This possibility is now investigated by comparing test 
statistics for the two halves of each series. 

Dependence between stock returns was detected using the first auto- 
correlation coefficient, гу, calculated from rescaled returns. This co- 
efficient has also been calculated from each half of the fifteen stock series, 
in every case using 1365 rescaled returns. All thirty coefficients are positive. 
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Twelve are significant at the 5 per cent level for the first half (1966 to 
mid-1971) and fourteen are significant for the second half (mid-1971 to 
1976). For nine of the fifteen series the coefficient for the later period 
exceeds that for the earlier period. There is no evidence that the depen- 
dence decreased after 1971. Coefficients ғ), calculated from returns pro- 
vide identical conclusions, all thirty again being positive. 

A rank correlation test ranks the values of r; to see if stocks with high 
dependence in the first period have higher than average dependence in 
the second period. The correlation between the ranks of r,, in the two 
periods is 0.850 and for the ranked r, it is 0.543. Both rank correlations are 
significant at the 5 per cent level (one-tail test). It thus appears that certain 
stocks consistently have higher than average autocorrelation. Alcoa had 
the highest r, for both halves whether calculated from returns or rescaled 
returns. 

Dependence among commodity and currency returns was demonstrated 
using the trend statistic 77. This statistic has also been evaluated for each 
half of the appropriate series. For the six metal, five agricultural, and four 
currency series discussed in Section 6.8 it is found that 13 out of 15 have Ту 
significant at the 5 per cent level for the first half with 11 significant for the 
second half. All the metal test values are higher in the first half than the 
second. There is not a clear pattern for the agricultural futures and the 
currency series. Coffee and sugar statistics are slightly higher for their first 
halves, corn and cocoa give very similar values for both halves, and the 
currency series split evenly between higher values in the first and second 
halves. It is noted that 7* for half a series is, with one exception, always 
greater for rescaled returns than for returns. 


6.14 CONCLUSIONS 


Prices did not follow random walks at the markets considered. Daily 
returns calculated from prices were not generated by some uncorrelated 
process having a constant mean. These conclusions have been established 
for many actively traded stocks, commodities, and currencies so itis likely 
that they apply to almost any financial asset. 

More generally prices do not reflect information fully and instantane- 
ously and thus markets are not perfectly efficient. This conclusion follows 
from Section 6.11 and the inefficiencies extend beyond calendar effects 
and institutional constraints. Of course the statistical dependence iden- 
tified by random walk tests is very small and therefore prices reflect most 
information accurately and fairly quickly. Nevertheless it appears some 
information is not used as well as it could be. 
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Daily returns from US stocks on consecutive days are positively corre- 
lated, also noted by Perry (1982) and others. The first-lag autocorrelation 
coefficient is small and usually in the range 0.05 to 0.15. There is no 
evidence for longer term trends but, instead, there may be a tendency 
towards negative dependence at time lags of more than one day. The 
logical ARMA model for US stock returns is a first-order moving average, 
MA(1), although this cannot represent any dependence at lags beyond a 
single day. Forecasts derived from MA(1) models could be an input to a 
trading rule for stocks. However, any ordinary citizen using such a rule is 
certain to incur trading costs far greater than any gross profits (cf. Fama and 
Blume, 1966). This concludes our discussion of US stocks: the markets are 
efficient according to any practical definition (e.g. Jensen, 1978) but prices 
do not always correctly reflect relevant information on the day it becomes 
available. 

Contrasting results are obtained for commodities and currencies. The 
statistical dependence among daily returns occurs at severallags, therefore 
some information takes several days to be fully reflected by prices. An 
explanation based upon price-trends is consistent with the sample auto- 
correlations, a majority of them being positive. Appropriate theoretical 
autocorrelations are those of a mixed autoregressive, moving-average 
process, ARMA(, 1). Price-trend models are described further in Chapter 
7 and suitable forecasts of future returns are assessed. These forecasts can 
be used to help construct trading rules which may provide futures traders 
with consistent net excess profits. Some trading results are presented in 
Chapter 8. 

Two methodological conclusions are emphasized. Firstly, random walk 
tests must use rejection criteria which are valid for the null hypothesis 
considered. Hence, following the arguments given in Chapter 5, the 
parametric tests have been evaluated using rescaled returns. Results are 
similar when returns are used but their correct interpretation is unclear. 
For some reason rescaled returns display more dependence than returns. 
A number of elementary explanations have been explored unsuccessfully. 
Further efforts follow in Chapter 7 and are more successful. 

Secondly, some test statistics are substantially more powerful than their 
competitors for the detection of certain autocorrelation structures. It is 
recommended that r, is used for random walk tests on stock data and a 
trend statistic is used for commodities and currencies. The statistic 7* 
involves subjectively chosen constants and if this is unacceptable the 
zero-frequency statistic f can be used to test for the trend alternative. 
Other test statistics give poor results forthe series tested here. In particular 
the runs test cannot be considered satisfactory because it uses relatively 
little information, see Section 6.4. 
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Interesting results can be obtained from 1000 rescaled returns if a 
powerful test statistic is chosen, however longer series should be used if 
possible. Large returns should be checked against a secondary source if 
one is available and then the tests will be more reliable, see Section 6.7. 


6.15 SUMMARY 


The random walk hypothesis is false for every long series tested. Stock 
prices can take a day longer than necessary to reflect information correctly. 
Itis believed this informational inefficiency cannot be exploited. Commod- 
ity and currency prices have reflected relevant information slower and the 
efficiency of their markets will be investigated in Chapter 8. 


APPENDIX 6(A) CORRELATION BETWEEN TEST VALUES FOR TWO 
RELATED SERIES 


Suppose (Y; ,} and (Y;,j are stationary, cross-correlated processes, their 
interdependence being entirely contemporaneous so that 


cor (Yis, Y) = A if s — t, 
= 0 otherwise, 


for some А # 0. Without loss of generality it is assumed each process has 
mean zero and variance one. There will be some correlation between Ri, 
and К, ,, defined here by 


Ro E YY |B Yin jun 
= t=1 
Define A,, B,, and C, by 


1 aT ЖЕСТ 
A, = n > > МЕ МЕН У.т, 
ЕЕЕ 


n 1-4 
В, - > Үз апа C, = n > ere 
t=1 


зз 


so that nR; „Rz, equals A,/(B, C,). 
Assuming each process is strict white noise, as n increases: 


E[A,] > E[Yi Yi ns Y; t Y2 tar] TE А5 
B,—51 апа С,->1, 


hence 
nE[R, ,R,,] > A? 


and 
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cor (R,,, R, ,) — А“ also. (6.A.1) 


A similar argument shows there is no asymptotic correlation between 
R, „ апа R, whenever т 7 £. It then follows that for any constants a, the 
linear combinations L, and L, defined by 


L = > 4€. 
т>0 
have correlation A? too. The trend statistics 7%, U*, and f, provide relevant 
examples. 

The result (6.4.1) should be a good approximation if the processes are 
uncorrelated rather than strict white noise. Estimates of À are low for either 
returns or rescaled returns from different assets. Typical estimates are 
between 0.25 and 0.5 for pairs of metals. 


Chapter Z . 


Forecasting Trends in Prices 


Detailed models for trends in prices are now conjectured and used to 
forecast returns. Slight improvements upon random walk forecasts are 
shown theoretically and empirically. Further theoretical results show that 
trend models imply traders might be able to use the forecasts advantage- 
ously. 


7.1 INTRODUCTION 


If there are trends in commodity and currency returns it will be difficult to 
predict them. No forecast of the next return can be substantially more 
accurate than a forecast based upon a random walk model. We have to 
settle for very slight improvements in forecast accuracy at best and perhaps 
none at all. 

Price-trend models offer the best chance to find improvements upon 
random walk forecasts. Specifications of trend models must to some 
degree be pragmatic, conjectural, and simplistic. Two specifications are 
described in Section 7.2 and assessed by considering the empirical accu- 
racy of the forecasts they suggest. Model parameters are estimated in 
Section 7.3 and used to show why rescaled returns can be more autocorre- 
lated than returns in Section 7.4. Theoretical and empirical forecasting 
results are presented in Sections 7.5 and 7.6 respectively. Having shown 
that price-trend forecasts can out-perform random walk forecasts, some 
theoretical calculations are given in Section 7.7 showing that the forecasts 
would be valuable if the models are specified perfectly. 


7.2 PRICE-TREND MODELS 


Trends will only occur if some information is reflected in several consecu- 
tive returns. We assume each item of information is reflected either quickly 
or slowly. Only quickly reflected information is fully incorporated into 
prices on the same day as it first becomes known by market agents. 
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Assuming returns X, have stationary mean и, we can write 
X, = + (ш — и) + e, (7.2.1) 


with e, being the response to quickly reflected information and и, — the 
response to slowly reflected information. By definition the e, have zero 
mean and are uncorrelated whilst the ш, have mean џи and are auto- 
correlated. Also м, and e, аге uncorrelated, but not necessarily indepen- 
dent, for all s and t. Thus var (X) = var (uj) + var (ej) and, for stationary 
processes, А = var (ш)/маг (X) is a measure of the proportion of slowly 
reflected information. 

The trend idea requires slowly reflected information to influence several 
returns in the same way. Suppose a particular news item is first available on 
day t and, when it is properly understood by everyone, changes the price 
logarithm by а». If the news is slowly reflected it contributes to 4+, — y for 
several т > 0, otherwise it contributes to e,. Supposing a slow interpreta- 
tion, the trend idea suggests a contribution а, о; to H+, — м with the а, 
non-negative and summing to one. These a, might be random variables but 
will be supposed to be deterministic to simplify the discussion. As time 
goes on, more and more people properly understand the information, 
therefore monotonically decreasing =, are appropriate. The simplest 
model for the a, then supposes a,,,/a, = p for all 7 > 0 with 1 > р > 0. 
Then а, = (1 — p)p' and the response aoo, on the first day is a fraction 
1 — p of the total response. Let m = 1/(1 — p). Then the total response is 
equal to m times the first day's response and m will be called the mean 
trend duration. 

Applying all the preceding assumptions and, furthermore, assuming 
stationarity, { ш, ) is ап AR(1) process having autocorrelations p”: 


ш — ш = pua — ш) + 6. (7.2.2) 


The residual 2, represents the effect of all the slowly reflected news first 
available on day t. There might be no such news and then 4, is zero or there 
might be several items with 2, = ag È оң, using an obvious notation. 
Assuming (ej is stationary it then follows that the returns have auto- 
correlations 


р.х = Ap’, (7.2.3) 


as in the price-trend hypothesis, equation (6.5.2). Other trend processes 
have the same autocorrelations (Taylor, 1980, Section 2). 

So far we have sketched a model for the trend component {m} but more 
details are needed to specify a complete model for returns (X). In 
particular, the consequences of changing conditional variances need to be 
specified. Two possibilities will be described and compared throughout 
this chapter. 
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A non-linear trend model 


Conditional standard deviations V, were linked with the general level of 
market activity in Chapter3. This market level will be related to the quantity 
of recent information and other variables. As the amount of new informa- 
tion per unit time increases, the amount reflected slowly may also increase. 
Assuming the proportion reflected slowly is constant across time leads to a 
product process, as in Taylor (1980). This is defined by (7.2.1) and 


B — k = МТ, е = Ve. (7.2.4) 


Then defining U, to be T, + е, we obtain the product process representa- 
tion X, = u + V,U,. The processes {V,}, (T,), and {є,} are assumed to be 
stochastically independent, {є,} is zero-mean strict white noise, E[7,] = 0, 
var (&) = 1 — A and var (T,) = var (u)/var (X) = A. This model will be 
called a non-linear trend model because the process {uw} is non-linear. 

Now consider stationary models with the standardized trend component 
(T,) having autocorrelations p”. Then the standardized returns process 
(U,) has autocorrelations 


рыу = Ар" (7.2.5) 
and, applying (3.7.4), the returns {X;} have 
px = Ap'(E[V,V,,,/E[V:]). (7.2.6) 


The term in curly brackets is less than one (see (3.7.5)). Typically it 
decreases from 0.99 at lag 1 to 0.84 at lag 50 (Taylor, 1982a, р. 216). Note {m} 
is now not AR(1) and therefore р, x is not Ар”. When (7,) is stationary and 
(7.2.5) applies but (V,) and hence (X,) are not necessarily stationary, 
conditional autocorrelations can be defined by 


cor (X,, X. |v,, Мат) = cor (U,, U...) = Ар". (7.2.7) 


To define a complete model for the simulations presented later it will be 
supposed that (log (V,)} and {T,} are Gaussian (hence linear) AR(1) pro- 
cesses with autocorrelations ó” and р” respectively and that the ғ, have 
normal distributions. 

Another way to model a constant proportion of slowly reflected informa- 
tion retains the equation е, = V,e, but models д, using the AR(1) equation 
(7.2.2) with Z, = V,& for strict white noise £. This has some appeal because 
itisalogical development of the ideas used to define (7.2.2). Unfortunately 
it does not permit (X,) to be a product process. 


A linear trend model 


Suppose now that as the amount of new information per unit time changes 
the amount reflected slowly remains constant. This suggests retaining 
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(7.2.1) and e, = Vfe, but now the processes {m}, {V7}, and {e,} аге 
stochastically independent. When ғ? is relatively high there will be more 
noise in the returns but no increase in the size of trends and thus there is 
less dependence among returns. Less autocorrelation should then be 
observed when the variance of returns is relatively high. 

Assuming all processes are stationary let А = var (uj)/var (X,) once more 
and let the variance of the zero-mean, strict white noise process (ғ,) be 
1 — A. Also let о? = var (X). Then given a realization v? of V7 the con- 
ditional variance of X, is Ag? + (1 — А)у??. 

We consider models having autoregressive trends with i.i.d. residuals so 
{ш} is linear and AR(1) with autocorrelations p”. Then 


X = +e, е = Ме, 
and 
ш-к-р(ш-а- ш) + $ (7.2.8) 


and these equations define a /inear trend model. The returns then have 
autocorrelations 


рх = Ар". (7.2.9) 
The correct definition of the standardized return is now 
U, = (X, — м/у 
with 
V, = avt (7.2.10) 
and 
a = V — А + AIE[V*2JF[V* C 2])). (7.2.11) 


These equations ensure that the U, have mean zero and variance one. 
Furthermore, as U, equals (а/а) + (ш, — a) V, the process (U,) is almost 
strict white noise. Although V, and U, are not independent, every pair V, 
and U, are uncorrelated. Given a realized value v,, the conditional variance 
of X, is Aa? + (1 — A)v2/a and will be approximately v? since А must be 
small and hence а = 1. It can easily be shown that the standardized returns 
have autocorrelations 


pu = АРЧЕПУДЕГУТ Vr a). 


The term іп curly brackets can be far more than one. If {log (V,)} is 
Gaussian and highly autocorrelated, with variance 8?, then 


Pu = Аре“. (7.2.12) 


As estimates ñ are often about 0.5 the ratio р, џ/р, x could exceed 2.7. 
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The comparison of (7.2.9) with (7.2.12) shows rescaled returns will 
display more autocorrelation than returns for a linear trend model. Such 
models might therefore be able to explain the empirical observation 
T£ > Tx, explored further in Section 7.4. 

A complete model for simulations is defined by supposing (log (V,)) and 
{ш} are Gaussian, AR(1) processes with autocorrelations ф "апа рапа that 
the z, have normal distributions. Approximation (7.2.12) is then very good 
for low A and high ф, the exact result being 


Pry = Арте 9 (1 — A + де"). 


7.3 ESTIMATING THE TREND PARAMETERS 


Methods 


Estimates of the two trend parameters, A and p, are needed to obtain 
optimal linear forecasts. An ARMA(1, 1) process has autocorrelations Ap’ if 
its parameters are defined appropriately. These parameters, p and q, are 
convenient for finding the optimal linear forecasts for trend models. If (£j 
is white noise and {P,} is defined by P, — pP,., = & — 46-1 then its 
autocorrelations are Ap’ with A = (p — q)(1 — pq)/{p(1 — 2pq + q’)}, 
see equation (1.9.13). Given A and p, qis the solution of 


q? – q{1 + (1 — 2A)p?}/{(1 — Ap) + 120 (7.3.1) 


andq < 1. 

The parameters pand q (and hence A) could be estimated by maximizing 
the likelihood of the data if the returns process was linear. This maximiza- 
tion is impossible for practical non-linear processes. Thus estimates have 
been obtained by matching theoretical and observed autocorrelations as 
in Section 3.9. We consider the following functions, defined for K auto- 
correlations: 


K 
F(A, p) = n > (ri, — Ар")? (7.3.2) 


т=1 


апа 
K 
F,(A, p) = (n — 20) > (fy Ар”). (7.3.3) 
т=1 


Autocorrelations г, , апа г, are defined by (2.9.1) and (6.8.3) respectively 
with n the number of returns used to calculate r, ,; r;, is г, , multiplied by 
n/(n — kr) to reduce bias and r; , has asimilar definition with k being either 
1 for a spot series or the number of contracts for a future series. | have used 
K = 50 for spot series and K = 20, 30, or 50 for futures series based оп 3, 6, 
or 12 months data per contract. 


Forecasting trends in prices 179 


When trend sizes depend on market volatility (a non-linear trend model) 
both г; х and r, are approximately unbiased estimates of p, = Ap’. 
However, г; , has smaller variance and so the numbers А and D minimizing 
F,are appropriate estimates. If, on the other hand, trends are independent 
of volatility (a linear trend model) then г; , is a highly biased estimate of 
р.х = Ap’ whilst r} , is unbiased although often inaccurate. It is probably 
best in these circumstances to seek the numbers Á and D minimizing F. 

To minimize either F, or F, mean trend durations m = 1/(1 — р) = 2,3, 
4, ..., 40 days are considered and for a given m the best unconstrained A 
can be obtained using calculus. For a fixed m and hence p the function Е, is 
minimized by 


K 
Ata = > p.f X p" (7.3.4) 


Likewise, we obtain Až m. Sometimes А* m is negative so it is necessary to 
consider: 


A n = Аў if Afm 0, 
= 0 otherwise. 


Let S; m = Қ(А,,,1- Ут) fori = 1,2, and m = 2,...,40. Minimizing S; m 
over m gives the estimates A; and p; minimizing Е;, and hence rfi; and also d; 
by appropriate substitutions in (7.3.1). A constraint on p, here D x 0.975 
(rh < 40), appears to be necessary as 5; m can continue decreasing by very 
small amounts as m increases. 


Futures 


Table 7.1 presents estimates for the futures series. The estimates A,, т, В, 
апа д; аге given, also Ғ(А;, р;) and the comparable function value for a 
random walk model, i.e. Ғ(0, 0). It can be seen that the majority of the 
estimates of m exceed 10 trading days. Thus, if the statistical methods used 
here are reliable, it takes over a fortnight for some information to be fully 
reflected by prices. These methods were first published in 1980 and no 
serious objections to them are known. 

The estimates of A are, of course, very small. More than half of the 
estimates A, are more than 0.02 and a majority of the А; exceed 0.04. The 
estimated proportion of the returns variance attributable to trends is thus 
10 to 100 times as large as the proportion explainable by the assorted 
equilibrium models discussed in Section 6.11. 

For mostofthe series А? m decreases monotonically as m increases. Only 
corn series have any negative Af m and then only for m x 4. Six of the 
estimates т, in Table 7.1 are 40 days because of the previously mentioned 
constraint. 
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Table7.1 Estimates of price-trend parameters for futures series 


Function 
optimized Á m p 4 F(A, p) F(0,0) 
Series 
Corn (12) [2 0.0092 40 0.975 0.967 1817 186.5 
F, 0.0109 33 0.970 0.961 72.7 78.4 
Corn (6) F, 0.0213 14 0.929 0.911 99.2 109.0 
F, 0.0164 40 0.975 0.962 33.8 47.9 
Corn (3) F. 0.0195 27 0.963 0.947 1043 117.6 
F, 0.0213 40 0.975 0.959 46.9 27.5 
Cocoa (12) F. 0.0225 18 0.944 0.96 1126 124.0 
F, 0.045 17 0.941 0.906 56.9 105.4 
Cocoa (6) [2 0.0203 29 0.965 0.949 88.8 100.9 
F, 0.043 18 0.944 0.909 41.8 88.9 
Cocoa (3) [2 0.0330 19 0.947 0.922 51.8 72.4 
Р, 0.0636 12 0.917 0.870 24.2 74.0 
Coffee (12) [2 0.0191 11 0.909 0.893 84.5 89.2 
F, 0.0770 8 0.875 0.819 60.6 113.0 
Coffee (6) [2 0.026 11 0.909 0.887 86.2 95.2 
F, 0.0982 6 0833 0.763 44.6 98.6 
Coffee (3) F, 0.00171 40 0.975 0.962 102.3 111.3 
F, 0.0798 8 0.875 0.817 38.8 89.8 
Sugar (12) F. 0.029 18 0.944 0.926 76.4 99.4 
F, 0.0367 22 0.955 0.927 66.3 138.5 
Sugar (6) F. 0.0335 15 0.933 0.907 54.4 89.9 
F, 0.0521 16 0.938 0.899 34.3 1253 
Sugar (3) F, 0.0322 16 0.938 0.912 48.0 81.3 
F, 0.0601 14 0.929 0.885 33.5 135.6 
Wool (12) [2 0.0974 2 0.500 0.453 2542 264.5 
F, 0.0436 3 0.667 0.639 72.8 77.8 
£/$ (6) [2 0.0120 40 0.975 0.965 20.9 25.1 
F, 00244 34 0971 0.952 24.6 40.5 
DM/$ (6) [2 0.0089 12 0.917 0.901 39.7 43.4 
F, 0.0480 13 0.923 0.887 45.3 71.2 
SF/$ (6) F. 0.0234 17 0.941 0.922 46.7 54.9 
Р, 0.0206 40 0.975 0.959 45.2 57.7 


Figure 7.1 shows plots of S; „ against m for the longest sugar series. The 
minimum value of S, „ is only 66.3 which is close to the 50 expected when 
the ғ;, have variance 1/(п — 20). The least value of 5; „ is higher (often 
much higher for other series) because the r; , have variances greater than 
1/п. Figure 7.2 shows plots of A, „ against m for the same series. 

Comparing the estimates A, with the estimates A, it is clear that for most 
series Á, is much larger than Á,. When m is the average of rfi, and rf», the 
ratio Á, „/А\ m is approximately 1 for corn, 1.7 for sugar, 2 for cocoa and the 
currencies, and 2.5 for coffee. It appears from these figures that a linear 
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Figure7.1 Plots of 5; „ against the mean trend duration m for sugar futures returns 
(i = 1, dashed curve) and rescaled returns (i = 2, solid curve) 


trend model is a better model than a non-linear trend model but something 
'in between' may be even better. 

The median m, and the median Mm are both about 17 days with the 
median values of A, |; and A, |; being 0.022 and 0.044 respectively. A typical 
futures standard deviation is oy = 0.02. Then, using the median Ау, the 
trend component has standard deviation equal to о, = 0.020.022 = 
0.0030. Furthermore, suppose the trend м, equals its standard deviation о„. 
The expected return over day t is then 0.3 per cent and the partially 
interpreted information is eventually expected to change the price by 
about 17 times 0.3 or 5 per cent. 


Spots 


Table 7.2 presents estimates for the six metals series and the spot currency 
series. All the estimates Á; are virtually zero and the estimates À, for the 
metals are smaller than for the futures. The negligible Á, are probably the 
result of negative first-lag coefficients r , (Table 2.9), indicating a reversal 
effect which obscures any price-trend behaviour. 
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Figure 7.2 Plots of the estimates А, „ against m for sugar futures returns (i = 1, 
dashed curve) and rescaled returns (i = 2, solid curve) 


Table7.2 Estimates of price-trend parameters for spot series 


Function 
optimized Á rh ñ ó F(A, D) F(, 0) 
Series 
Gold Fi 0.0053 20 0.950 0.945 93.1 93.6 
F, 0.0377 19 0.947 0.919 66.1 90.1 
Silver Е, 0.0047 40 0.975 0.971 173.3 173.8 
Р, 0.0190 40 0.975 0.961 64.2 71.1 
Соррег F, 0.0080 31 0.968 0.961 109.1 112.8 
F, 0.0220 20 0.950 0.932 54.3 72.0 
Lead F, 0.0014 40 0.975 0.974 64.2 64.3 
F, 0.0233 40 0.975 0.958 39.2 68.0 
Tin Fi 0.0017 40 0.975 0.973 85.6 85.7 
F, 0.0123 40 0.975 0.965 41.1 49.2 
Zinc Fi 0.0028 40 0.975 0.972 149.3 149.7 
F, 0.0148 40 0.975 0.963 39.6 51.3 
£/$ F, 0.0093 40 0.975 0.967 80.0 83.1 


Р, 0.0460 40 0.975 0.946 54.5 128.6 
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Accuracy 


The estimates Á, D, and ó are far from accurate. Their standard errors are 
considerable. Results for maximum likelihood estimates of p and q are 
given by Box and Jenkins (1976) for linear processes but cannot be applied 
to our estimates. The standard error (s.e.) of Á; presumably exceeds that of 
А? m when т = гї. Thus if the autocorrelations have variances 1/n we 


expect from (7.3.4) that 
s.e.(Á) > ((1— pnp’. (7.3.5) 


For example, n = 2000 and р = 0.95 gives s.e.(A) equal to at least 0.007. 
More information about accuracy is desirable and is now obtained using 
simulations. 


7.4 SOME RESULTS FROM SIMULATIONS 


Both types of trend model have been simulated in order to obtain insights 
into the accuracy of the parameter estimates and also to find out if the 
models can explain the empirical observation ТУ > Ту noted in Section 
6.9. Sets of forty series were constructed by using the models described in 
Section 7.2. Returns having a linear trend component (based upon equa- 
tions (7.2.8)) were used for sets 1 and 2 with a non-linear component used 
for sets 3 and 4 (based on (7.2.4)). All simulations had p — 0.95, so m — 20, 
and lognormal terms V, with В = 0.5 and ф = 0.985. Set 1 had A = 0.02 and 
then the theoretical first lag autocorrelation is 0.019 for returns and 0.050 
for standardized returns; set 2 had A — 0.01; set 3 had A — 0.04 giving a 
first-lag theoretical autocorrelation of 0.038 for both returns and rescaled 
returns; set 4 had A — 0.02 and different random number sequences were 
used for each set. All the autocorrelation coefficients were calculated from 
2000 observations, as in Section 6.11. 


Estimates 


Table 7.3 summarizes the estimates А; and p; obtained by minimizing 
F,(i = 1, 2) and it also summarizes the coefficients гі, and ry. It can be 
seen that the estimates ñ; are considerably biased. All the averages of forty 
estimates D; are less than 0.9 whereas pis 0.95. This downwards bias causes 
estimates A; to have an upwards bias. For models containing linear trend 
components, д, has less bias than D; and, as expected, the averages of 
estimates А, and coefficients г, , are more than the averages of A, and r x. 
For non-linear trend components the г), have less bias and smaller 
standard deviations than the г), and the estimates А, and р, are preferred. 
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Table 7.3 Summary of simulation estimates for two price-trend models 


Lineartrend component |. Non-lineartrend component 


1 2 3 4 

Simulation parameters 

A 0.02 0.01 0.04 0.02 

p 0.95 0.95 0.95 0.95 

B 0.5 0.5 0.5 0.5 

$ 0.985 0.985 0.985 0.985 
Coefficients lag 1 

Series x, mean 0.0111 — 0.0038 0.0274 0.0076 

Series y, mean 0.0364 0.0190 0.0366 0.0177 

Series x, st. dev. 0.0357 0.0312 0.0409 0.0481 

Series y, st. dev. 0.0236 0.0232 0.0209 0.0292 
Mean р; 0.816 0.573 0.895 0.716 
Estimates m, 

Number = 1 1 9 0 5 

Number x 4 11 23 3 17 

Number x 10 23 27 14 24 

Number x 20 29 30 32 28 

Number « 40 36 34 37 34 
Mean D; 0.897 0.810 0.892 0.737 
Estimates rh; 

Number = 1 0 1 0 4 

Number x 4 3 12 6 19 

Number x 10 18 23 17 24 

Number x 20 32 29 30 30 

Number « 40 36 37 37 32 
Estimates А, 

Mean 0.0388 0.0227 0.0453 0.0409 

Standard deviation 0.0334 0.0301 0.0328 0.0487 
Estimates А, 

Mean 0.0483 0.0387 0.0465 0.0400 

Standard deviation 0.0254 0.0339 0.0277 0.0422 


All the autocorrelation coefficients appear to be downwards biased. For 
these coefficients n = 2000 and the estimated bias over lags 1 to 30 is – 12/п 
for sets 1, 3 and —6/n for sets 2, 4. 

It is clear from Table 7.3 that estimates Â; and f; are inaccurate even when 
there are 2000 observations. A confidence region might be defined by all 
numbers A, and py for which Е; (Ap, Po) is less than A (Á;, р) + A for some A 
and í = 1 or 2. The simulations show that A = 6 will not give a 95 per cent 
confidence region as was incorrectly claimed in Taylor (1982a, p. 222). To 
Obtain a 95 per cent region, À must be at least 10. 
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Table 7.4 Summary of trend test statistics for simulated trend models 


Model 
foru, Set T: T: T: — T 
Mean SD Sig.? Mean SD Sig. Mean SD Positive? 
Linear 2.26 1.60 24 4.24 1.84 38 1.98 1.08 38 


1 

2 0.59 1.15 9 2.13 1.15 24 1.54 1.13 38 
Not 3 3.90 2.01 34 3.90 1.40 39 0.00 1.52 22 
linear 4 1.90 2.05 21 2.001 1.56 25 0.11 1.12 24 


*Number of statistics significant at the 5 per cent level, i.e. exceeding 1.65. 
b Number of series having T* > Т}. 


A puzzle solved 


As models containing linear trends have more autocorrelation among their 
rescaled returns y, than their returns x,, we expect ТУ > Т; for most 
simulated series this being the puzzling phenomenon observed for real 
series in Section 6.9. Table 7.4 presents the average values and standard 
deviations for TZ, Ту, and the differences T% — Ту, for each set of forty 
simulated series. The table also shows the number of series in each set 
having Ту > Т; and the numbers of test statistics significant at the 5 per 
cent level. 

The differences 7; — Т; for sets 1 and 2 show that a linear trend 
component can explain why these differences are positive for real series. 
Thirty-eight of the forty differences are positive for each of sets 1and2. The 
average difference is 2.0 for set 1 compared with 3.1 for the six metals series 
and 2.5 for the sixteen futures series. The average difference for simulated 
series will be proportional to Vn and the simulated series are shorter than 
the real ones. Thus the results for set 1 satisfactorily explain the values of 
differences Ту — Т; observed for commodity and currency series. Notice 
that because Ту usually exceeds Т; the former statistic is more powerful for 
random walk tests. Table 7.4 shows the extra power can be substantial. The 
average difference for the stock series is 0.7 and cannot be explained 
comfortably by a trend model having a very small p. 

The results for sets 3 and 4 show that trend models having the same 
autocorrelation between returns for all levels of market volatility cannot 
explain the observed positive differences. Satisfactory trend models must 
have decreasing dependence among returns as the conditional standard 
deviation of returns increases. 


7.5 FORECASTING RETURNS: THEORETICAL RESULTS 


It is assumed in this section that observed returns {x,} are generated Бу а 
stationary stochastic process (X,) having mean y, variance o°, and auto- 
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correlations Ар”. We consider linear forecasts of X,,, made at time t and, 
for each h, seek the constants a; p, minimizing the mean square error (MSE) 
which is 


= 2 

el en -м-Уа,мХ;- и) |. (7.5.1) 
i=0 

The best constants a;, for a given horizon h define the optimal linear 

forecast and it will be denoted F, p. 

We know that realistic processes (X,) are non-linear and it is then 
probable that some non-linear forecast ҒҒ, has a smaller MSE than F, ,. It 
is very difficult to find the best non-linear forecast and it would probably 
not be much more accurate than F,,. Consequently, we only consider 
linear forecasts. 

As the MSE for a linear forecast is a function of the а; р and the variance 
and autocorrelations of the X, it can be assumed that (X,) is described by 
the АКМА(1, 1) model 


X — k р(Х 1 H) = & — 461 (7.5.2) 


with (&) zero-mean white noise and q defined by equation (7.3.1). Alterna- 
tively, we may argue that there is an infinite-order, moving-average repre- 
sentation X, = и + > bj& ; (Section 1.10) and this can be simplified to 
obtain (7.5.2). 


The next return 


Making use of the methods presented in Section 1.9 it can be seen from 
(1.9.11) that 


Xai - b= ён + (р - Ф У q'(X,.; — м 
і-0 
and therefore the optimal linear forecast is 
Ei—-unt(p-qNEqX.;-u) (7.5.3) 
і-0 


= + (p — q)(X, — ш) + qa - М. (7.5.4) 


Thus an observed forecast f, , can be calculated using the latest return x, 
the previous forecast f,—1 1, and estimates of и, p, and q. The coefficients in 
(7.5.3) have total (p — q) > q! = (p — qY(1 — q) which can be much less 
than 1. 

Now X,,, equals F, + &+ and thus the MSE of F, ; equals the variance of 
& 41. Also the forecast Е, and the forecast error &,4 are uncorrelated (cf. 
Granger and Newbold, 1977, pp. 122-3). From (1.9.12) it follows that 
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var (£44) = MSE(F, 1) = (1 — р2)02/(1 — 2pq + q?) (7.5.5) 
and thus 


var (Fa) = (p — qYae?/ – 2pq + 9?) = Apip — q)o?/(1 — pq). 
(7.5.6) 


Compared with the random walk forecast for X,,,, which is simply и and 
has MSE equal to o2, the proportional reduction in MSE obtained by using 
the optimal linear forecast is 


TE o? — MSE(F 1) _ var fa = АР(Р- 9 (7.5.7) 


о о 1 — pq 
Supposing (X,) is defined by one of the trend models described in Section 
7.2, perfect knowledge of the future trend ш, would make possible the 
maximum, feasible, proportional reduction in MSE, namely A. Thus (7.5.7) 
shows linear forecasts can only obtain the proportion p(p — 9)/(1 — pq) ої 
the maximum reduction. 

To illustrate the small magnitudes of the numbers described by these 
formulae suppose A = 0.03 and p = 0.95 so that q = 0.9267. Then the 
forecast weights have total (p — q)/(1 — q) = 0.32, the percentage reduc- 
tion in MSE using F, is 100Ap(p — q)/(1 — pq) = 0.56 per cent and this is а 
proportion p(p — q)/(1 — pq) = 0.18 of the best that could be achieved 
using superior and perfect trend forecasts. 


More distant returns 


For horizons h > 1, equation (1.9.16) shows that the optimal linear forecast 
Е, , is the following function of u, p, and Fi: 


Еһ = u + p" (F4-— и). (7.5.8) 


Sums of future returns 


Trading decisions require forecasts of the total return from day t to day 
t+ h inclusive. Let Sa = X1 + X5 + +++ + Xun be this total. Straight- 
forward algebra shows that the optimal linear forecast of S, , is simply 


Gum Һа + һә + + Ra = һи + 0 — puo ad — p). 
(7.5.9) 


This forecast has MSE equal to Е((5,, — G, ,)^], which is less than the MSE 
of the random walk forecast (hw) which is var (5,,). It follows that the 
proportional reduction in MSE obtained by using the optimal linear fore- 
cast is 
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_ var (Sh) = MSE(G, һ) — Var (Grp) 


: 7.5.1 
Wp var (S, р) var (5, >) ( 0) 
Егот (7.5.6) апа (7.5.9), 
= pn^ rJ 2 
var (G4) = (=) Ap(p — Фо” (7.5.11) 
1-p 1 — pq 


anditis easy to show that 
var (5,,) = o2[h + 2Ap(h — (1 — p^ya — p)y( — p), (7.5.12) 


and hence W, can be calculated. 

For example, if A = 0.03 and p = 0.95 then W, = 0.56 per cent, 
W, = 2.05 per cent, W = 2.92 per cent, И, = 3.25 per cent, and the 
maximum W, is Wis = 3.26 per cent. For these trend parameters the ratio 
var (5,,)/ (һо?) is 1.11, 1.23, and 1.41 for h = 5, 10, and 20 with limit 
1 + 2Ap/(1 — p) = 2.14as h > o. 


7.6 EMPIRICAL FORECASTING RESULTS 


We now compare price-trend forecasts with random walk forecasts over а 
selection of forecast horizons for commodity and currency series. Each 
series has been split as in Section 4.3. The first two-thirds of a series is used 
to estimate the parameters appearing in the forecasting equations, then 
forecasting accuracy is compared on the final third of the series. The 
notation fi will refer to forecast number j of the return x,,, made at time t 
whilst g/ describes the corresponding forecast of the total return X1 + 
X2 + +++ + Xp Over the next h days. Of course fÀ equals 211. 


Benchmark forecasts 


Forecasts based upon trend models must be compared with some bench- 
mark forecast. Random walk forecasts when expected returns are zero 
define our first set of forecasts by 


gi = 0. (7.6.1) 


These forecasts are logical for futures series when it is believed there is no 
risk premium. The obvious random walk forecast when expected returns 
are believed to be non-zero utilizes the latest sample mean. Thus our 
second set of forecasts are 


t 
g) = h X xit. (7.6.2) 
s=1 
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Price-trend forecasts 


We consider three forecasts derived from price-trend models and esti- 
mates of the model parameters. Initially we suppose the expected return is 
zero, a reasonable assumption when studying futures prices. The par- 
ameters p and q have been estimated from returns x, and rescaled returns 
y, using the methods given in Section 7.3 applied to two-thirds of each 
series, say times t = 1 to n, inclusive. Estimates ó and д calculated from x,, 
1 < t <€ n, define the third set of forecasts by adapting (7.5.4) and (7.5.9) 
to give 


fei = (p — d» + GF 1 
and 


gË) = a — дт — р). (7.6.3) 


The fourth set of forecasts, g) , are defined їп a similar way to the third 


set by using estimates ó and д calculated from y, 1 € t € m. These 
estimates are probably biased because the results presented in Section 7.3 
show the y, are more autocorrelated than the x,. It may therefore be better 
to recognize that ó and д estimated from various y, ought to be used to 
forecast later y,. Linking forecasts of rescaled returns, returns, and con- 
ditional standard deviations by the equation 


Ya = fi 1/ Vis (7.6.4) 


it then follows (Taylor, 1980, p. 357) that an appropriate forecast of x;,, is 
defined by 


ffi = (aD — @% + GF a}. (7.6.5) 
Forecasting all future conditional standard deviations by %; it can be 


deduced that 
gi» = (1 — BFP - p). 


Summary statistics 


Given forecasts 21), made for the final third of a series, say for n, < t 
= n — h, empirical mean square errors can be calculated as 
n-h 
MSE(h, D= Y Onate xus — gin — nm — h). 


t=n,+1 


(7.6.6) 


Percentage reductions in MSE relative to the benchmark forecast k are then 
defined by 
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PMSE(h, j, k) = 100(MSE(h, k) — MSE(h, j))/MSE(h, k) (7.6.7) 


with k chosen to be 1 or 2. 

The magnitude of forecast errors depends upon the conditional standard 
deviation of the returns. Thus the MSE and PMSE statistics tell us more 
about forecasting accuracy when the conditional standard deviation is high 
than about accuracy averaged over all variance possibilities. It is therefore 
relevant to consider scaled forecast errors also, defined as forecast errors 
divided by an appropriate estimated standard deviation. These scaled 
errors define the weighted MSE and PMSE by 


WMSE(h, j) = ——-—— x к= toot хаһ = gih) 


n= m= ha Viel 


(7.6.8) 
and 


PWMSE(, j, k) = 100(WMSE(h, k) — WMSE(h, у) /WMSE(h, К) 
(7.6.9) 


with б, essentially defined as before, in fact (4.5.1) or (5.8.3) with x set 
equal to zero. It should be noted that the WMSE criterion for h = 1 can be 
written in terms of forecasts y(/, of ут = Х+1/й +1 as 


У (ин = ЎИЗ) (п — n, 1). 


Thus WMSE measures forecasting accuracy applied to rescaled returns. 


Futures 


Results are tabulated for eight futures series. Series using twelve months 
data per contract were selected for corn, cocoa, coffee, and sugar, to 
minimize the number of initial values required for the forecasting equa- 
tions. The other four series are for wool, sterling, deutschmarks, and Swiss 
francs. When a contract contributed returns from time t, + 20 until time t; 
to a series, with further returns on this contract available from t, to t, + 19, 
then | set ЕЛ = Ofort = t; — 1and all j and the value of v,, for t = t, was 
set at 1.253 times the average of |х, | over times t; to t; + 19. Forecasts fin 
were then calculated for times t, to t; — h and included in the calculation 
Of MSE etc. for times t, + 19 to t, — h inclusive. (For h > 1 no forecasts 
were made from t, — h + 1 to t; — 1; (7.6.6) and (7.6.8) were revised 
appropriately.) 

Comparing the two benchmark forecasts showed it is better to assume a 
zero expected return than to predict returns using past average values. 
Consequently k is 1 when percentage reductions in MSE relative to a 
benchmark forecast are calculated. Table 7.5 presents these percentage 
reductions, PMSE(h, j, 1) and PWMSE(h, j, 1), for h = 1, 5, 20апау = 3,4, 
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Table 7.5 Percentage reductions in mean square error 


PWMSE(h, j, 1)? PMSEth, j, 1)? 
Forecast j=3 4 5 j=3 4 5 
h=1 
Corn (12) 0.08 0.09 0.12 0.12 0.14 0.15 
Cocoa (12) 0.50 0.99 0.98 0.45 0.78 0.74 
Coffee (12) 0.30 022  -026  -025 -137  -371 
Sugar (12) 0.29 0.25 0.10 | —-022  -0.9 -0.69 
Wool (12) 0.04 | —024 -0.44 0.28 | —0.67 -1.08 
£ (6) 0.28 0.32 0.32 0.13 0.00 | -0.07 
DM (6) 0.64 0.97 1.04 0.14 0.08 | —0.04 
SF (6) —1.07 0.33 0.19  -181  -0.19 -0.48 
h-5 
Corn (12) 0.49 0.61 0.83 0.89 1.12 1.26 
Cocoa (12) 1.90 3.96 4.05 2.23 4.53 4.88 
Coffee (12) 0.76 032  —107  -024 -178 -920 
Sugar (12) 2.12 2.12 135  -0.51  -135 —3.18 
Wool (12) -0.00  -1.14 -207  -0.60 4-270  -441 
£ (6) 0.72 0.65 0.64 0.33 -029 -0.95 
DM (6) 1.64 2.49 2.93 0.57 0.54 0.57 
SF(6) 0.38 1.25 0.62 -053 -043 -1.54 
h = 20 
Corn (12) 0.58 0.63 1.26 0.49 0.29 1.33 
Cocoa (12) 0.96 1.50 1.38 2.13 4.42 5.23 
Coffee (12) 0.65 0.48 | —0.00  -406  -543 -14.2 
Sugar (12) 4.16 4.54 3.17 139 -0.51  -432 
Wool (12) —0.28 | —5.30  —827 —024 —6.18 -12.06 
£ (6) 0.80 —0.01 0.42 2.81 1.99 2.67 
DM (6) 1.73 2.63 2.84 1.25 1.84 1.39 


SF (6) 0.81 3.24 0.67 0.87 1.69 —2.56 


“Weighted MSE reduction, defined by equation (7.6.9). 

^Unweighted MSE reduction, defined by (7.6.7). 

*Number of forecasts evaluated for h — 1 were: corn 1248, cocoa 1005, coffee 1006, sugar 
1781, wool 1250, £, DM, SF all 754. 


5. The following average percentage reductions in MSE are obtained by the 
three price-trend forecasts. 


Average PMSE Average PWMSE 
j= 3 4 5 ј = З 4 5 
һ = 1 — 0.14 —0.20 —0.59 0.18 0.28 0.26 
5 0.25 —0.04 —1.58 0.99 1.28 0.91 


20 0.58 —0.22 —2.82 1.21 0.96 0.18 
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Every PWMSE average 15 greater than the corresponding PMSE average. 
This shows that it is harder to achieve any reduction in MSE when the 
conditional variance of returns is relatively high. This conclusion is in line 
with the deduction in Section 7.3 that a linear trend model is better than the 
non-linear trend model there considered. It would appear that as the 
variance of returns decreases the magnitude of trends does not and thus 
forecasting the trend component becomes easier. 

Looking at the PWMSE averages it is seen that all are positive although all 
are, of course, very small. This strongly suggests that it is possible to find 
forecasts marginally better than random walk forecasts, especially when 
the variance of returns is not substantial. The third and fourth forecasts 
have comparable accuracy and are better than the fifth set. The average 
reductions in MSE are, as predicted, greater for h > 1 than for h = 1. 

The best results were obtained for the cocoa, sugar, and deutschmark 
series using the PWMSE criterion. It is difficult to imagine how the more 
successful series could have been identified before assessing the forecasts. 
Certainly the reductions in MSE expected after studying the first two-thirds 
of the series were a poor guide to future forecasting accuracy. For example, 
the anticipated PWMSE figures for sugar were 1.15 per cent for h = 1, 4.26 
per cent (h = 5), and 7.10 per cent (h = 20), for 6 and д from rescaled 
returns, but the actual PWMSE were only 0.25, 2.12, and 4.54 per cent for 
the fourth forecast. 

The PMSE averages are mainly negative and confirm the extra difficulty 
forecasting returns when their variance is high. The coffee and wool results 
are particularly uninspiring and the poor Swiss franc result for h = 1is due 
to a high estimate Á with D = 0.5. 


Spots 


Results are given for the spot currency series and six spot metal series 
(gold, silver, copper, lead, tin, and zinc). Their returns ought to have 
positive means and indeed the latest sample mean is a marginally better 
predictor than an assumed zero mean. Thus k is 2 in calculations of PMSE 
and PWMSE. Also the price forecasts are marginally improved by replacing 
u by the latest mean fll) in equations (7.5.4) and (7.5.9). With these 
modifications the average percentage reductions in MSE for price-trend 
forecasts are as follows. 


Average PMSE Average PWMSE 
j=3 4 5 j=3 4 5 
h=1 —0.13 —0.50 —1.00 0.15 0.28 0.02 
-0.13 —1.56 —3.62 0.73 1.03 0.30 


20 0.45 —1.80 —6.14 0.91 1.28 —0.16 
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The spot averages confirm the conclusions deduced from the futures 
averages. It is possible to achieve slight reductions in MSE but only when 
the variance of returns is relatively low. Again the third and fourth forecasts 
are superior to the fifth set. All but one PWMSE spot average is positive and 
they are similar to the futures averages. On the other hand, all but one 
PMSE spot average is negative and they are more negative than the futures 
averages. The best results occurred for the fourth forecast with h = 20, 
particularly for silver (3.1 per cent reduction in WMSE), lead (4.3 per cent), 
and tin (2.0 per cent). 


7.7 FURTHER FORECASTING THEORY 


It appears that price-trend forecasts can be slightly more accurate than 
random walk forecasts, but are they of practical value? Some theoretical 
results are now given showing that price-trend forecasts are potentially 
valuable on some days, to be followed in Chapter 8 by the profitable results 
of appropriate trading rules. We continue to assume that returns are 
generated by a stationary process, having mean u, variance о? and auto- 
correlations Ар”. 

The correlation between the forecast F, апа tomorrow's unobservable 
trend component w+ will be denoted by A. Using the properties of F, , 
given in Section 7.5, 


var (F, 1) = cov (Fa, Хал) = COV (Fia, ел) 
and thus 


À = Vivar (F, /var (ш.1)) (7.7.1) 
= V(p(p — qYa - ра) (7.7.2) 


making use of (7.5.6). For the typical values A = 0.03, p = 0.95, and 
q = 0.9267 the correlation À is 0.43. Note from (7.7.1) that the standard 
deviations of F, 1, ++, and X,+: are related by 


Of = Ав, = Àox VA. (7.7.3) 


Expected changes in prices 


The optimal linear forecasts are unbiased. Therefore if the forecast f, , is k 
standard deviations from its mean u the expected return over day t + 1 is 
и + Кок and the expected return over h days, i.e. the expected change in 
the price logarithm, is 


hu + ka (1 = р? — p). 
This differs from the random walk prediction hu by 
kAox VA — p^y(1 — p), (7.7.4) 
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applying (7.7.3). Now an investor can wait until K is relatively large, say 
k = 2. Then the expected additional returns when À = 0.03, p = 0.95, and 
ох = 0.02are2.4percentforh = 10,and3.8percentforh = 20,increasing 
to 6.0 per cent for very large h. Such figures suggest that a decision-taker 
who neednotactupon every forecast may be able to use selected forecasts 
advantageously. 


Forecasting the direction of the trend 


It can be important to forecast the direction (up or down) of any trend 
correctly. We estimate the probability of correctly forecasting the direction 
by assuming шлапа f, have a bivariate normal distribution. Then 


Hil fa rr N(f, 1, a- A2)o2). 


Thus, if firstly д = 0 and secondly the forecast f, ; is a positive number Кот, 
the estimated probability that the trend is positive becomes 


Р(М(Әо,, (1 — À2)e2) > 0) = O(kA/V(1 — А?) (7.7.5) 


with Q(.) the cumulative distribution function of the standard normal 
distribution. When A is 0.43 the estimated probability of correctly forecast- 
ing the trend's direction is 0.68 and 0.83 for forecasts о; and 2o, respec- 
tively. These figures suggest that on many days the direction of the trend 
can be predicted correctly. 


Forecasting prices 


Suppose g, n is the optimal linear forecast of log (2,,/2) = Ха + c + 
Xn- Then a simple forecast of Zin is z, exp (6,,). This forecast will be 
biased. An unbiased forecast can be obtained if it is assumed that the 
conditional distribution of 5, р = log (2,;,/2,) given z and h = {X х,_1,...) 
is normal. It then follows that the best forecast of Z}, is 


EIZ, +n | l, z] = z,E[exp (5r | h)] 
= Z, exp (g&n + àlvar (5,,) — var (G,4)] (7.7.5) 


which can be evaluated using (7.5.11) and (7.5.12). 


7.8 SUMMARY 


Price-trend forecasts can provide very small reductions in mean square 
forecast error. Such reductions appear to vary inversely with the appro- 
priate conditional variance of returns. This can be explained by the 
magnitudes of trends being determined independently of the conditional 
variances of the random component of returns. 
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Two ways to estimate the trend parameters have been suggested. 
Although different estimates arise for the same series the forecasting 
accuracies achieved are similar on average. Non-linear forecasts might be 
better than the optimal linear forecasts discussed, especially if the weights 
given to past observations vary inversely with both the conditional vari- 
ances and the time lag. 


Chapter 8 -- 


Evidence Against the Efficiency of Futures Markets 


Price-trend models are now used to construct trading rules. These rules are 
compared with the best investment strategies when investors believe the 
efficient market hypothesis is true. The trading results are shown to give 
better results for all six futures series considered and therefore the 
corresponding markets are unlikely to be efficient. 


8.1 INTRODUCTION 


A market will be said to use information efficiently if there is no way to use 
the information to increase expected wealth by frequent trading. This 
definition is discussed in Section 8.2. We continue to consider only the 
information provided by the present and past prices. An efficient market 
implies all trading rules are worthless, i.e. price information cannot indi- 
cate the best times to buy or sell. 

Many professional traders disbelieve market efficiency. Advertised suc- 
cess stories rarely present a complete and honest description of trading 
results, for reasons left to the reader's imagination. The academic commu- 
nity on the other hand generally favours the efficient market paradigm. 
This is commendable for share markets but much less is known about the 
efficiency of futures markets. We should expect academic publications to 
be biased towards market efficiency. As Granger (1979, p. 104) remarks: ‘If 
such a [foolproof] strategy were found, it would hardly be made public, 
even by an academic'. However, notall academics have sufficient capital to 
run a successful trading system! 

Potentially profitable trading rules are presented in Section 8.7 butbetter 
rules exist. Trading rules having a good chance of making the efficiency of 
futures markets appear dubious are discussed. In this respect, the rules are 
highly successful. The combination of expected return and risk is deliber- 
ately not optimized and other important improvements are known to be 
possible. 
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Profitable trading results are given for cocoa, coffee, sugar, sterling, 
deutschmark, and Swiss franc futures in Section 8.9. The other available 
futures prices (for corn and wool) are relatively out of date. Concentrating 
on the six selected series may be considered a biased decision based upon 
hindsight. This is denied. These series would have appeared to be promis- 
ing ones for trading rule research if only the first half of every series 
available had been subjected to random walk tests and the estimation of 
price-trend parameters. Trading rules rarely do well on share prices, see 
Fama and Blume (1966) and Ball (1978). 

Trading rule research is fraught with methodological problems. Several 
are apparent in earlier studies of futures markets (Section 8.3) and studying 
futures introduces problems measuring returns and risks (Section 8.4). 
Section 8.5 summarizes trading conditions for ordinary investors and 
Section 8.7 lists realistic assumptions for assessing the performance of 
trading rules. 

Theoretical and empirical evidence against efficient pricing is described. 
Little is known about the amount of autocorrelation required to refute the 
efficient market hypothesis. Efficiency is likely if any non-zero autocorrela- 
tions are small and confined to low lags. It is possible for markets to be 
inefficient, however, if small autocorrelations occur at several lags. 
Theoretical arguments are given in Section 8.6. 

Each futures series is divided into two parts for a realistic assessment of 
practical trading rules. The first part (calibration contracts) is used to select 
two trend parameters and two trading constants, a process aided by 
simulating a price-trend model (Section 8.8). The second part (test con- 
tracts) is only used to assess the results of hypothetical yet realistic trading. 
Average net returns are shown in Section 8.9 to exceed the comparable 
averages from efficient market strategies for all six series. The implications 
of these results are discussed in Section 8.10. Figure 8.1 summarizes the 
methodology used to investigate market efficiency. Note the important 
role of price models. 


8.2 THE EFFICIENT MARKET HYPOTHESIS 


We adopt a practical definition of the efficient market hypothesis. The 
hypothesis is said to betrue ifthe risk-adjusted return, net of all costs, from 
the best trading rule is not more than the comparable figure when assets 
are traded infrequently. This is essentially the definition given by Jensen 
(1978). It assumes trading rules buy and sell the same goods fairly often, 
typically more than once a year. In comparison, goods should be traded at 
efficient markets as little as possible to minimize costs. Note that investors 
may never want to trade assets like futures when they believe the markets 
are efficient. 
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Ideas about inefficient pricing 


Parametric trend models for 
returns 


Separate calibration 
contracts from test 
contracts 


Derive appropriate random 
walk tests 


Autocorrelations for calibration 
Contracts show trend models 
plousible 


Investment strategies 
if markets ore 
efficient 


БЖ Mathemoticol results 


obout autocorrelation 
and efficiency 


Estimate trend parameters 


Forecasting equations 
Troding rules 


Select trading rule constants 


by simulating trend models 


Revise trading constants 
using calibration contracts 


Compare trading rules and 
efficient market strategies 
on test contracts 


Figure 8.1 A summary of methodology. Reproduced (with revisions) from Taylor 
(1983) by permission of Basil Blackwell, Oxford 


Trading rules generate more business for brokers than efficient market 
strategies and therefore have higher transaction costs. Thus rejection of 
the random walk hypothesis does not necessarily imply rejection of the 
efficient market hypothesis. In addition to transaction costs there are costs 
due to differing bid and ask prices, the acquisition of information, the 
interpretation of information and perhaps taxes. It costs almost nothing to 
obtain daily closing prices. Developing trading systems is, however, time 
consuming and requires specialist skills. The associated costs are impos- 
sible to measure and will be ignored: only the inventor of a trading system 
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can decide if the development costs are less than any net profits. Daily 
interpretation of information requires time too. Very little time is needed 
to operate the rules described in this chapter. Similarly, negligible com- 
puter resources are required. 

Risk must be considered when trading results are assessed. Ideally, risk 
should be discussed in a portfolio context. Thus alternative investments 
such as shares mustnot be forgotten when futures trading is contemplated. 
Fortunately it is fairly easy to take account of such alternatives because 
returns from futures are almost uncorrelated with share returns (Dusak, 
1973; Bodie and Rosansky, 1980). 

Our definition of the efficient market hypothesis has the advantage of 
being practical at the cost of being person dependent. Resources, oppor- 
tunities, and attitudes towards risk vary between people. In particular, 
ordinary investors face higher trading costs than market members. 


8.3 PROBLEMS RAISED BY PREVIOUS STUDIES 


Few investigations into the performance of trading rules upon futures 
prices have been published and almost none after 1976. This is surprising 
as work by Stevenson and Bear (1970, 1976), Leuthold (1972, 1976), and 
Martell and Philippatos (1974) suggests some US markets are inefficient. 
These studies do not prove inefficiencies. Their evidence, however, does 
raise interesting possibilities deserving further research. The articles cited, 
and others, also draw attention to several important methodological 
issues. 


Filter rules 


The trading rule assessed in most research articles is the filter rule of 
Alexander (1961) or some variation thereof. It depends on a single par- 
ameter g. If futures are bought, called a long position, then they are held 
until the price falls g per cent from a peak; i.e. for a purchase on day | sell 
on the first day j > i for which the price z; is g per cent or more below the 
maximum price between days / and j — 1 inclusive. After cancelling the 
long position, futures are sold short on day j and the short position held 
until the price rises g per cent from a trough; i.e. go long again on the first 
day k > j for which z, is g per cent or more above the minimum price 
between days j and k — 1 inclusive. The range 0.5 < g x 25 has been 
considered. 

Filter rules are intended to let profits accumulate when there is a distinct 
trend, changing from long to short and vice versa when the trend appears 
to have changed direction. An initial position can be taken as soon as the 
price has moved g per cent from its initial level. The rule can be assessed 
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on a series of daily closing prices, though real trades may not be possible 
or practical at exactly the close. Fama and Blume (1966) discuss filter rules 
in more detail. 

Stevenson and Bear (1970) evaluated 12, 3, and 5 per cent filters on daily 
corn and soybeans futures prices between 1957 and 1968. The 5 per cent 
filter rule (their rule 2A) was more profitable than a buy and hold strategy 
after deducting commission costs. Buy and hold is a series of long 
positions, futures being bought on the first day a contract is considered 
and sold on the corresponding final day. Annual rates of return for the filter 
rule and the competing buy and hold strategy were not given. Impressive 
dollar profits were published yet cannot be interpreted easily. 

Leuthold (1972) evaluated six filter sizes for daily live beef futures prices 
from 1965 to 1970. Profits occurred for five filter sizes after deducting 
commission costs. These profits were not compared with buy and hold as 
Leuthold believed this strategy was not the appropriate benchmark. The 3 
per cent filter gave the largest profit. However, profits were highly sensitive 
to the filter size, being negative for the 5 per cent filter and positive again 
for the 10 per cent filter. High rates of return on a margin investment were 
given. They are difficult to interpret. 

Bird (1985) has evaluated filter rules for cash and forward transactions at 
the London Metal Exchange. These forward contracts are similar to futures. 
Risk-adjusted profits may have been possible before July 1977 but not in 
later years. 

Other applications of filters to futures offer less of a challenge to the 
efficient market hypothesis. Martell and Philippatos (1974) claim adaptive 
rules are successful but give insufficient details. Praetz (1975), for Sydney 
wool futures from 1966 to 1972, found no evidence that filter rules were 
superior to buy and hold. 

Published results, particularly those of Stevenson, Bear, and Leuthold, 
cannot refute market efficiency for commodity futures because an 
adequate methodology has yet to be established. Three problems are 
emphasized. 


Benchmarks 


[t is far from obvious what the returns from a filter rule should be compared 
with (Leuthold, 1972; Cargill and Rausser, 1975). The issue of risk premia is 
relevant. Risk-free assets, such as government securities or bank deposits, 
are appropriate for comparisons if it is believed there is never a risk 
premium. On the other hand, buy and hold must be assessed if a positive 
premium is believed to exist. Recall from Section 2.3 that the empirical 
evidence is indecisive but perhaps favours the view that there are premia 
for agricultural futures. Another complication is a bias in filter rules when 
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there is a positive premium, caused by taking short positions against the 
premium (Praetz, 1976a). This bias makes it harder to identify any market 
inefficiencies. 


Significance 


Very little can be said about the distribution of filter returns when the null 
hypothesis of an efficient market is true. Thus the results of Stevenson, 
Bear, and Leuthold cannotbe usedto refute the efficient market hypothesis 
(Praetz, 1976b). For tests of the less general random walk hypothesis, 
Praetz (1979b) and Bird (1985) have derived test statistics based upon filter 
decisions. They assumed normal and identically distributed returns. It is 
unclear how much these strong conditions can be relaxed. Furthermore, 
we must appreciate that the distribution of a filter result is not the same for 
the two hypotheses, random walk and efficient market. Filter results are 
functions of sums of certain returns X, (Praetz, 1979b) and the variances of 
these sums depend on the autocorrelations of the X,, which do not have to 
be zero for an efficient market. 


Optimization 


Any trading result obtained by a retrospective optimization of the filter size 
g must be evaluated cautiously (Praetz, 1976b). Most published studies 
contain a dubious optimization. Traders could not guess the best filter size 
in advance and itis unlikely an optimized filter will be optimal in the future. 
The correct procedure is, of course, to splitthe prices. Then choose gusing 
the first part and evaluate this g upon the remaining prices. This procedure 
requires many years of daily prices. 


8.4 PROBLEMS MEASURING RISK AND RETURN 


Recall that our preferred definition of an efficient market refers to risk- 
adjusted net returns. Ideally this definition should be applied to trading 
results. 


Returns 


Returns from share investments can be measured easily. This is not so for 
commodity futures because of margin trading. A position can be taken іп а 
futures market by depositing 10 per cent or less of the position's value with 
a broker. It is possible to define the return from a trade by identifying the 
margin deposit with the amount invested (Leuthold, 1972). The investment 
is then highly risky. However, losses may well exceed the (initial) deposit 
so definitions involving margin payments are surely inappropriate. 
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It is arguable that a better approach is to consider some sum to be 
invested in a futures fund, then to calculate the fund's return after 
deducting commission costs and including interest earned on money not 
required by brokers. Perhaps interest can also be earned on the margin 
deposits. Trading amounts would have to be chosen to make the chance of 
losing more than the entire fund negligible. 


Risk 


Once we have a way to calculate the return from a trading strategy it is 
necessary to consider adjustments for risk. Returns can often be improved 
by taking greater risks but any higher returns thereby obtained do not 
refute market efficiency. Risk-adjusted returns are desirable. These can be 
found for share investments by using a model that prices risk, e.g. the 
capital asset pricing model mentioned in Section 6.11. The expected 
reward for accepting a share's undiversifiable risk is then proportional to 
the covariance between share return and the return on the market portfolio 
made up of all shares. Applying this idea to filter results may appear tricky 
but a satisfactory solution is given by Ball (1978) for studies involving 
several shares. 

Commodity futures again pose problems. Empirical estimates of correla- 
tions between returns from US futures and US shares indices are very close 
to zero (Dusak, 1973; Bodie and Rosansky, 1980). Perhaps the market 
portfolio should be defined to include more than shares. Futures should 
not be included, as in Carter et a/. (1983), because the aggregate position in 
futures is always zero (Black, 1976). Instead, privately owned agricultural 
and mineral resources may be relevant. Another approach is to consider 
consumption risk in a pricing model (Breeden, 1980). 

Accepting zero covariance between returns from futures and diversified 
share portfolios has important consequences. Investors need not be 
rewarded for risks borne by buying futures, as such risks can be diversified 
away (Black, 1976). Then the expected return from futures is zero and, 
theoretically, no risk adjustments are needed for trading results. 

However, if reality and theory differ futures may still be expected to pay 
a risk premium. It would then be an excellent idea to augment share 
portfolios by including futures. Bodie and Rosansky (1980) have argued 
that combining commodity futures with a diversified share portfolio would 
have decreased the standard deviation of annual returns from 19 to 13 per 
cent without any reduction in the expected return. Trading rules should be 
compared with buy and hold in these circumstances. Risk measurement is 
now difficult as zero covariance and a positive risk premium are contradic- 
tory in the usual asset pricing framework. When the measurement of risk 
reappears in Section 8.6 an intuitive solution will be proposed. 
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Necessary assumptions 


To conclude the comments on problems faced when researching trading 
rules, note that assumptions must be made about prices. Usually it is 
assumed trades can be transacted at published closing prices. This presup- 
poses a highly liquid market. It also supposes each trading decision has no 
effect on subsequent prices. In particular the quantity traded must be 
relatively small and other traders do not react to the trading rule's recom- 
mendations. 


8.5 TRADING CONDITIONS 


Throughout this chapter we consider futures trading from the standpoint 
of an ordinary investor. The investor can only trade by giving instructions 
to a broker. Brokers charge commission for opening and closing a futures 
position (a ‘round trip’). They also require a margin deposit when a trade is 
opened. This deposit is intended to ensure that the investor can pay forany 
losses. Further margin deposits may be demanded if the price moves 
against the investor. UK brokers do not pay interest on margin deposits. US 
investors are more fortunate. They may be able to deposit Treasury bills 
and later keep the interest paid on the bills. 

Commission rates are either fixed by the market's clearing house or 
competitively by brokers. Rates change from time-to-time. Suppose a 
trade is opened at a price z, for T units thus involving goods worth Q = Tz, 
when the trade begins. Then we will assume asingle payment of cQis made 
at time t to a broker covering all trading costs. We make the simplifying 
assumption that c is the same constant for all times considered. We include 
commission and additional costs due to different bid and ask prices in the 
total cost cQ. An appropriate value of c for cocoa, coffee, and sugar futures 
traded in London is between 0.5 and 1.0 per cent. Currency futures traded 
at Chicago have far lower dealing costs; c was about 0.2 per cent for the 
years studied here. 

Margin requirements depend on the broker and the investor's credit- 
worthiness. It will be assumed that dQ is deposited with a broker when a 
trade is opened, for some constant d. Conservative values for d are 10 per 
cent for the London agriculturals and 3 per cent forthe Chicago currencies. 

When the price moves against the investor, the effective deposit 
becomes the sum of all margin deposits minus the current loss on the 
futures. If this balance falls below eQ it will be assumed, from Section 8.7 
onwards, that the balance is restored to dQ by making a further margin 
deposit. Again e is some positive constant and e < d. 

Futures are defined by standard contracts. Thus T, the number of units 
traded, is in practice some multiple of a basic lot size. Recent lot sizes are 
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10 tonnes of cocoa, 5 tonnes of coffee, 50 tonnes of sugar, 25000 pounds 
sterling, 125 000 deutschmarks and 125 000 Swiss francs. The value Qof one 
lot recently ranged from £5000 to £20000 for the agricultural goods and 
from $25 000 to $60 000 for the currencies. 

An astute UK investor need not pay tax on gains from futures trading. 1 
assume the same situation applies in the US and therefore tax will not be 
mentioned again. 


8.6 THEORETICAL ANALYSIS 


Sufficiently large and long-lived trends in prices will make a market 
inefficient. Either returns or rescaled returns have autocorrelation Ap’ at 
lag 7 days when a price-trend model applies. It is of interest to know how 
large A and p must be to create inefficiencies. Theoretical answers can be 
given although several assumptions must be made. Practical answers are 
less definite and are offered in later sections. 

An investor's optimal strategy when a futures market is efficient could be 
do nothing (i.e. forget this market) or buy and hold futures. Even when 
there is no risk premium it could be optimal to buy and hold (we see why 
later). Consequently, a market will be called inefficient in this section if an 
investor can adopt a trading strategy outperforming bank deposits when 
that is the preferred benchmark and can adopt another strategy out- 
performing buy and hold should that be the preference. The strategies and 
their profit potential were first described in Taylor (1983). 


Trading strategies 


Trades will be initiated for certain values of the standardized forecast k, 
defined for a stationary returns process by 


k = (fj, — м/оу. (8.6.1) 


Here оу is the standard deviation of the variable generating the optimal 
linear forecast (fı) of the next return (х,у) with ш the unconditional 
average return. 

Firstly, suppose the preferred benchmark is a risk-free investment, say 
depositing cash in an interest-bearing bank account. Strategy 1 is intended 
to earn a high return when a substantial upward trend is predicted: 


(1a) Invest at a risk-free rate on day 0, 
(1b) Buy on day t > 0 if k exceeds some number ky, 
(1c) Sell on day t + Nand revert to a risk-free investment. 


The trader chooses kç and N. It is not necessary to say how much is bought 
and sold. Strategy 1 is clearly risky. Nevertheless if strategy 1 makes more 
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on average than a comparable risk-free investment then the strategy can be 
viewed as an asset whose inclusion in a diversified portfolio will improve 
the investor's combination of expected return and risk. 

Secondly, assume the benchmark is buy and hold. Strategy 2 abandons 
the long position when a substantial down-trend is predicted: 


(2a) Buy T units of futures on day 0, 

(2b) Sell on day t > 0 if k is less than some number —K, and invest all cash 
returned by the broker at the risk-free rate for N days, 

Qc) Buy the same quantity of goods again on day t + N. 


| assert that strategy 2 is less risky than buying and holding T units of 
futures. The reason is that strategy 2 has an open position in futures less 
often than buy and hold. Thus strategy 2 will be clearly superior to buy and 
hold when the strategy has the higher expected return, for it also has the 
lower risk. 


Assumptions 


Several assumptions must be made to obtain exact results. These are as 
follows: 


(i) There are a few wealthy well-informed investors. Such people know 
all the parameters of the process generating prices, so they know the 
trend parameters A and p. They can trade without altering the path 
prices take. They also have sufficient wealth to engage in maximal 
share market diversification thus ensuring the risks of strategies 1 and 
2 can be ignored. 

(ii) The returns process is Gaussian. Thus all sets of returns have mul- 
tivariate normal distributions. Returns have mean и and variance o. 

(ін) Departures from a benchmark strategy always last N days, N being 
fixed in advance of any trading. 

(iv) Trading costs equal to cQ are paid when a position worth Q is 
opened. 

(у) Only one margin deposit per trade is required: an amount dQ is held 
by a broker who pays no interest upon it. 

(vi) One interest rate applies to all risk-free lending and borrowing, 
whatever the duration of the deposit or loan. Simple interest is 
calculated at the rate r per trading day and paid when the future 
ceases to trade. 

(vii) Each futures contract is traded for one year. 

(viii) Money can be transferred instantaneously between the investor's 
bank account and broker. 


206 Modelling financial time series 


It must be noted that (i) is too optimistic because parameter estimates are 
always inaccurate. However (iii) works against the investor as forecasts 
after departing from the benchmark strategy are ignored. Also (ii) repre- 
sents a considerable simplification. 


Conditions for trading profits 


Firstly, we compare a risk-free investment of capital Con day 0 with strategy 
Т. Buying futures worth Q on day t at the price z, when k > ky leaves 
C — (c + d)Q in the risk-free account. At time t the selling price on day 
t + Nis uncertain and described by the random variable Z,,,,. The wealth 
obtained from strategy 1 on day t + Nis the random variable 


W, = Ctr + (C — (c + ФО) + Nr) + dQ + О2,,- zJ/z. 
(8.6.2) 


Forming expectations at time t using the standardized forecast k, the 
expected wealth at time t -- N exceeds that from risk-free investment if 


E[W,|k] > C(1 + tr + Nr), 
which simplifies to 
(Шу = 2)/2|К] > c + Nr(c + а). (8.6.3) 


This inequality states that strategy 1 is better than a risk-free investment if 
the expected proportional price change exceeds the trading costs and the 
interest forfeited on the deposit and trading costs. 

Secondly, beginning with capital C on day 0, compare buying and 
holding futures worth Q with strategy 2. Let day t be the first after day 0 on 
which k < – о. Then strategy 2 sells on day t and buys the same quantity 
back at time t + Nto give wealth 


W, = (C — (c + d)Q}{1 + (t + N)r) 
+ [dQ + Q(z, — zyyzl1 + Nr] — СО, Zo (8.6.4) 


after completing transactions at time t + N. At the same time buy and hold 
gives wealth 


Wa, = (C — (c + d)Q)(1 + (t + N)r) + dQ + QZan — Zo)/zo. 
(8.6.5) 


Again taking expectations at time t, strategy 2 is better than buy and hold 
up to time t + Nif 


E[W,|k] > ЕІМӘніКІ. (8.6.6) 
Sufficient conditions are, from Taylor (1983), z, > zí and 


(1 + Мағуа + c) > 2,2, | КІ. (8.6.7) 
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When (8.6.7) and hence (8.6.6) are true itis possible that strategy 2 expects 
to have less money earning interest at time t + N than buy and hold. To 
ensure this does not happen it suffices to add the further conditions 
2.150 > (1 — dY(1 — d — c) and 


1 > F[Z.N/z|K]. (8.6.8) 


Suppose К, has been chosen. Then | will say а market is theoretically 
inefficient if (8.6.3) is true for k = kọ and some Nand, furthermore, (8.6.7) 
and (8.6.8) are true for k = —ky and some other value of N. It is necessary 
to choose ky. This number controls the frequency of departures from the 
benchmark strategy. | have let kọ = 1.65 so departures begin on up to 5 per 
cent of days. 

To assess (8.6.3), (8.6.7), and (8.6.8) for given parameters А, p, ш, and с 
we can use equations (7.3.1), (7.5.9), (7.5.11), (7.5.12), 7.7.2), 7.7.3), and 
(7.7.5). It is advisable to include a constraint on N in any evaluation 
program, say N < 100. 


Inefficient regions 


Results depend on the market parameters c, d, and r and the mean и and 
variance o? of returns. Commodities and currencies are discussed separ- 
ately. Firstly, consider examples based upon London agricultural futures 
and, following Taylor (1983), the parameter values 


c= 0.01, а= 0.2, r=0.12/250, д = 0.0005, с = 0.022. 
(8.6.9) 


These figures are compatible with Table 2.4, the discussion in Section 8.5, 
and interest rates in 1982, with a high d chosen to try and ensure assuming 
only one margin deposit is reasonable. The risk-free rate is 12 per cent per 
annum. The expected annual growth in prices is F[Z,o/Z;] — 1. This is 
exp (250u + 1250?) — 1 = 20 per cent fora random walk and is more when 
there are trends. Consequently buy and hold is the harder benchmark to 
beat. 

North-east of the curve 1,1; on Figure 8.2 a market is theoretically 
inefficient. The total autocorrelation over all positive lags does not change 
much оп the boundary curve between theoretically efficient and inefficient 
markets. Denote this total by 


ж E 


Y = У p, = > Ap! = Ар/(1 — p). (8.6.10) 
1 


т=1 т= 
Then a market is theoretically inefficient for the parameters (8.6.9) if 


Y > 0.42 and 0.975 > p > 0.5. (8.6.11) 
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Figure 8.2 A boundary between efficient and inefficient markets (L4 L,) when there 
is a risk premium. Reproduced (with revisions) from Taylor (1983) by permission of 
Basil Blackwell, Oxford 


Halving d or r moves the boundary a negligible distance. Doubling the 
trading cost c to 2 per cent increases the dependence required to exploit 
inefficiencies to Y > 0.62 whenever p > 0.8. 

Of course, it may be argued that there is no risk premium for these 
markets. Reducing ш to zero makes exploiting inefficiencies easier. А 
sufficient condition is then Y > 0.24 whenever p > 0.8. The harder bench- 
mark to beat remains buy and hold because the distribution ofthe year-end 
price is skewed: for a random walk £[Z;s)/Z,] = exp (12507) = 1.062 and 
thus (8.6.3) shows it is easy to improve on a risk-free investment. Letting 


ш = —}0° ensures E[Z;s/Z,] = 1 for a random walk and the risk-free 
benchmark is crucial. Theoretical efficiencies then occur if Y > 0.18 and 
р > 0.8. 


Secondly, consider examples based upon IMM futures and, following 
Taylor (1983) again, the parameter values 


c = 0.002, а= 0.1, r= 0.16/250, џи = 0, с = 0.006. 
(8.6.12) 
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The low standard deviation с causes the risk-free benchmark to be 
favoured to buy and hold. A market is theoretically inefficient for the above 
parameter values if 


Y>0.17 and р> 0.8. (8.6.13) 


Reducing r will reduce the autocorrelation consistent with an efficient 
market. Paying interest on margin deposits essentially makes d zero. Then 
Y > 0.15 and p > 0.8 ensures theoretical inefficiencies. Doubling cto 0.4 
per cent increases the autocorrelation required for profitable trades to 
Y > 0.31 whenever p > 0.8. Finally, very similar results are obtained if 
ш = —%о?. 


Some implications 


The estimates of Aand p presented in Chapter 7 can be used to discuss the 
theoretical efficiency of futures markets. Care is needed because a linear 
process is assumed for returns in this section but a non-linear process in 
Chapter 7. The linear process is intended to be an approximation to the 
true non-linear process operating between times tand t + Ninclusive. The 
autocorrelations of the true process are Ap' for the non-linear trend 
models of Section 7.2 but Ap* multiplied by a constant (determined by the 
conditional variance) for the alternative linear-trend models. It can be 
argued that suitable values of A when assessing the theoretical efficiency of 
real markets аге A, = A, exp (8?) and Á, for the values of a? discussed 
here; values of A, and A, appear in Table 7.1and the ñ values are taken from 
Table 3.3. Using these estimates and appropriate estimates of p we obtain 
the following estimates of the total autocorrelation Y. 


Estimates of Y 


Estimate of A 

Series А А, 

Сосоа (6) 0.69 0.84 
Coffee (6) 0.47 0.49 
Sugar (6) 0.64 0.78 
£/$ (6) 0.62 0.80 
DM!/$ (6) 0.26 0.58 
SF/$ (6) 0.44 0.80 


All these estimates imply a theoretically inefficient market according to 
the criteria (8.6.11) and (8.6.13). Thus very small autocorrelation at several 
lags can make markets inefficient. The six estimates of the coordinates 
(A, p) for the agricultural futures are indicated by crosses on Figure 8.2. 


210 Modelling financial time series 


Now consider typical results when the trading strategies deviate from the 
benchmark strategies. For cocoa and sugar we suppose the mean trend 
duration m is 16 days (so p = 15/16) and A is either 0.036 (from Ag) or 0.05 
(from А,). Consider A = 0.05. The largest difference between the two sides 
of inequality (8.6.7) occurs for k = —1.65 when N = 24. Assume z, = 20. 
Then selling futures at $1000 and buying again 24 days later at $974 (on 
average) makes $26 per unit, to which add $2 interest on the returned 
deposit then deduct $10 trading costs for a net profit of $18. Thus wealth is 
increased by $18 for every $1000 of futures sold, i.e. 1.8 per cent ofthe value 
of the goods sold. This is 19 per cent per annum assuming 250 trading days 
per year. Opting instead for N — 11 will make $13 per $1000 or 29 per cent 
per annum. These figures appear in column 2 of Table 8.1. Results for the 
lower value of A appear in column 1. 

Similar results for hypothetical currency trades are given in the third and 
fourth columns of Table 8.1. The hypothetical gains are smaller than for the 
agricultural goods. 


Table 8.1 Summary of hypothetical trading results 


1 2 3 4 

Cocoa? Cocoa ЕР £ 
Estimate for A 0.036* 0.0507 0.022< 0.036? 
Estimate for m 16 16 20 20 
N maximizing profit/trade 21 24 33* 43* 
Open trade at 1000” 1000” 10005 10005 
Close trade on average at 983 974 1010 1017 
Favourable price move 17 26 10 17 
Extra interest 2^ 2 
Interest lost 2! 3 
Trading costs 10! 10 2 2 
Total gain ($) 9* 184 6 12 
Total gain (%) 0.9 1.8 0.6 1.2 
Gaininannual terms 10% 19% 5% 7% 
N maximizing gain/day 13 11 15 12 
Gain in annual terms 14% 29% 6% 11% 


“Апа sugar. "And DM/$, SF/$. ‘From Â. "From Á,. “Negligible additional benefits from 
higher N. ‘Sell. Buy. ^1000Ndr. '1000N(c + d)r. ‘Maximum expected value. “Assumes 
Z, = 2. 


8.7 REALISTIC STRATEGIES AND ASSUMPTIONS 


The theoretical results show that it may be possible to exploit slight 
dependence between returns when a price-trend model is appropriate. 
Strategies and assumptions are now made as realistic as possible. All the 
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price series are then subdivided. Only the first part of a series is used to 
select trading parameters so that realistic trading results can be obtained 
for the second part of a series. These results will show that several markets 
were probably inefficient during the years studied. The following three 
sections are based upon an earlier publication (Taylor, 1983, pp. 184—93) 
updated to include a further year of prices. 


Strategies 


Trading decisions depend on a standardized forecast calculated by assum- 
ing a non-linear trend model: 


k = IB. (8.7.1) 
б, = Via {Ap(p — qYO — pq). (8.7.2) 


The forecast fi?! is defined by (7.6.5). The estimate ó, of the forecast's 
standard deviation has been derived from (7.7.2) and (7.7.3) with v,,, an 
estimate of the next return's standard deviation (equation (5.8.3) with x 
ignored, y = 0.1 and 1.253 replaced by 1.333). Whenever necessary, A, p, 
and qin (8.7.2) are replaced by estimates. There are of course other ways to 
define a standardized forecast. 

Strategies now have two parameters. One controls the commencement 
of trades (Ку) and the other their conclusion (k;). Each day begins with a 
market position: none, long or short. Then kis calculated and the position 
is reconsidered. 

Strategy l is an alternative to buy and hold intended to save money when 
a substantial downtrend is predicted: 


(la) Buy on the first occasion that k is more than k,. 
(Ib) When a long position is held: 

do nothing if k 2 k, 

sell and take no market position if k < k3. 
(Ic) When no position is held: 

do nothing if k < k,, 

buy if k > А. 


We might consider k, = 0 and k, = —2. Buy and hold is evaluated Бу 
setting k, = —1000 and k, = —1001. 

Strategy ll is an alternative to risk-free investments. It aims to profit from 
substantial trends in either direction: 


(Ha) When no position is held starta trade if k > k (buy) ork < —k, (sell). 
(ПЫ) When a long position is held: 

do nothing if k > k, 

sell and take no market position if k < kz. 
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(Пс) When a short position is held: 
do nothing if k x —k,, 
buy and take no market position if k > —k,. 


We might consider k, = 2 апа k, = 0. 

For both strategies trades have never been started during the final 15 
days considered for a contract. If necessary, trades have been concluded 
on the final day of the month preceding the contract's delivery month. 


Assumptions 


The list given for the theoretical analysis is revised to make the results 
realistic. The following assumptions reflect various constraints encoun- 
tered by investors. 


(i) Some investors think they know the stochastic process generating 
prices. They possess imperfect estimates of the trend parameters 
from which they can calculate k. They can trade without altering the 
path prices take. 

(ii) Instructions to brokers are made after reading yesterday's closing 
price in today's newspapers. Decisions then have to be evaluated at 
the next closing price, in effect one day late. For example, at 9 a.m. 
the investor has some market position, then reads the price at 4 p.m. 
on the previous day, then calculates k, then has any changes in 
market position made later at 4 p.m. Assuming such a long delay is 
pessimistic but does avoid assuming investors act upon information 
instantaneously. 

(iii) All quantities traded are chosen to make the value of the goods 
bought or sold at the beginning of a trade, denoted Q, equal to the 
capital available for investment, denoted C. Keeping Q constant, 
irrespective of the current price and resources, ensures that the 
trading strategy is less risky than buy and hold. 

Fixing Q at a number different to C merely selects a different 
combination of expected final capital and risk. For example, doubling 
Q will double all cash flows and therefore double (a) the difference 
between expected final capital and the wealth from risk-free invest- 
ments and (b) the standard deviation of the final capital. Note that the 
number of lots traded will not bean integer. (Bankruptcy could occur 
in highly unfortunate circumstances but never has in the calculations 
performed.) 

(iv) Trading costs equal to cQ are paid when a trade worth Q is begun. 

(v) An initial margin deposit dQ is sent to a broker. Should the investor's 
balance fall below eQ then the balance is restored to dQ by sending 
another deposit. Brokers do not pay interest. 
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(vi) Every agricultural or simulated futures contractis traded for one year, 
every currency contract for six months. 

(vii) All money not held by the investor's broker is kept in a bank account. 
It earns simple interest at some rate r per trading day which is credited 
to the account at the end of each trading period, see (vi). 

(viii) If a trade ends on day t the broker returns deposits plus any trading 
gains minus any trading losses on day t + D.. 

(ix) The bank permits immediate withdrawals but then deducts the 
interest earned on the amount withdrawn during the preceding D; 
days. 

All subsequent calculations use D, = 3 and D, = 5, the figures applica- 

ble to UK investors in 1982. The trading constants for the London agricul- 
tural futures have been set at 


c = 0.01, d=01, апа е = 0.05 
for comparison with 
с = 0.002, а= 0.04, апа е = 0.02 


for the Chicago currency futures. Thus further margin deposits аге 
required if half of the original deposit is lost. 

UK interest rates were set at 10 per cent per annum for simulated prices 
and the first part of the real price series, increasing to 14 per cent for the 
second part. US rates were taken to be 12 per cent before 1979 and 16 per 
centthereafter. 


Notes on objectives 


The parameters of the trading strategies are chosen to maximize the 
expected value of an investor's capital at the end of one year, subject to the 
constraint described by assumption (iii). The derived strategies are not 
expected to be mean/variance optimal or to maximize expected utility, 
although the strategies may well be close to optimal. Another objective is 
to make the efficient market hypothesis appear dubious when it is false. 


Hopefully the strategies are as good as alternatives concerning this objec- 
tive. 


8.8 TRADING SIMULATED CONTRACTS 


To operate the trading rules we need values for A, р, Ку, and k,. The first 
part of each futures series, called the calibration contracts, is used to 
estimate A and p. Then k, and К, are selected by optimizing trading results 
upon simulated contracts and the calibration contracts. Simulation is 
employed to avoid relying on the relatively short historical record defined 
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by the calibration contracts. Only when A, p, k,, and k, are all chosen do 
we consider trading results for the remaining contracts. These will be 
called the test contracts. 

Every simulated contract has 310 prices generated by a product process: 
the non-linear trend model of Section 7.2. Simulated returns X, are defined 
by u + V,U, with (log (V,)} Gaussian (mean а, variance 82, autocorrela- 
tions ó”) and {U,} also Gaussian (mean 0, variance 1, autocorrelations 
Ap’). The first 60 prices are only used to obtain initial values and then 
trading is simulated for one year. 


Commodities 


The calibration contracts are those used to test the random walk hypothesis 
in Taylor (1980). Buy and hold was highly successful for these contracts and 
thus defines the commodities benchmark strategy. 

Sugar contracts from 1961 to 1973 are compatible with the simulation 
parameters u = 0.0003, a = —4.25, B = 0.56, ф = 0.98, A = 0.04, and 
p = 0.95. Four hundred contracts have been simulated. The hypothetical 
investor began each year with £10000 and ended it with a final capital 
dependent оп k, and k,. Risk-free investments return 10 per cent per 
annum in the simulations, buy and hold averaged 26.3 per cent andthe best 
version of strategy | averaged 31.7 per cent. 

Table 8.2 shows the averages and standard deviations of the annual 
changes in capital for buy and hold and strategy | when the hypothetical 
investor correctly estimates А and p, and uses k, = 0 and k, = 1. The 
averages show buy and hold obtaining a 16.3 per cent risk premium and 
trading adding a further 5.4 per cent to capital. Since the annual standard 
deviations are very approximately twice the figure for individual stocks the 
high premium is realistic. The standard deviations are very high and this is 
why so many contracts need to be simulated. Standard errors for the 


Table 8.2 Summary of annual percentage changes in capital for simulated prices 


Benchmark strategy Trading strategy 
Mean _ St.dev. Mean St. dev. 
(%) (%) k, k, (%) (%) 
Parameters based upon 
Cocoa 58.9 61.6 0 —1.8 60.0 59.9 
Coffee 43.5 75.9 0 -1.4 49.7 72.5 
Sugar 26.3 62.0 0 -1.0 31.7 58.3 
Currencies 12.0 0 0.8 —0.3 16.2 10.2 


Reproduced from Taylor (1983, p. 188) by permission of Basil Blackwell, Oxford. 
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Figure 8.3 Boundary between efficient and inefficient markets for simulated 
returns. Reproduced from Taylor (1983) by permission of Basil Blackwell, Oxford 


averages are the standard deviations divided by 20, i.e. about 3 per cent. It 
сап be seen that strategy l is less risky than buy and hold, using the standard 
deviation as a risk measure. 

Even supposing trend models generate prices, investors would have to 
take decisions using imperfect estimates of A and p. This is not too 
important. Sugar prices simulated using different values for A and p to 
those assumed by an investor permit trading profits for a considerable 
range of trend parameters. North-east of the curve LL} on Figure 8.3 
trading gives better results than buy and hold if an investor selects 
A = 0.04, p = 0.95, k, = 0, and k, = —1. 

Further simulated prices have been obtained using the cocoa and coffee 
calibration contracts from 1971 to 1976. For cocoa, u = 0.0014, œ = —4.18, 
В = 0.45, ф = 0.987, A = 0.044, and p = 0.933 gave an estimated trading 
gain of 1.1 per cent per annum. For coffee, и = 0.0011, a = —4.39, 
B = 0.83, ф = 0.9, A = 0.12, and p = 0.833 provided an estimated gain of 
6.2 per cent. Table 8.2 summarizes the optimal results for 400 simulated 
contracts. 


Currencies 


The first calibration contract for each currency is the June 1974 and the last 
one is the December 1978. Ending with the December 1978 contract was, 
inevitably, an arbitrary decision. There are no good reasons for a buy and 
hold benchmark. Thus risk-free investment defines the currencies bench- 
mark strategy. 

All appropriate confidence regions for the currency trend parameters 
are large and encompass similar values. Consequently, common paramet- 
ers are used for all three currencies. Four hundred contracts have been 
simulated using и = 0, a = —5.34, B = 0.46, ф = 0.92, А = 0.028, and 
p — 0.95. Table 8.2 summarizes the results for the best trading constants, 
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k, = 0.8 and k, = —0.3. Strategy li then increases the average final capital 
to 4.2 per cent more than the interest rate. Both the premium from trading 
and the annual standard deviation are far less than equivalent figures for 
stocks. 


8.9 TRADING RESULTS FOR FUTURES 


Calibration contracts 


All trading results for the calibration contracts will be favourably biased, 
because A and p have been estimated from their returns. The following 
estimates are based on calibration returns and used for both calibration 
and test contracts. 


A p 
Cocoa 0.0440 0.933 
Coffee 0.1220 0.833 
Sugar 0.0395 0.952 
Currencies 0.0280 0.950 


Various values of к, and К, near to the optima estimated by simulation have 
been considered and final choices for the test contracts have been made by 
considering trading results from simulated and calibration contracts. It 
would not be a good idea to rely on the calibration results alone, for the 
averages for a few years can vary haphazardly when small changes are 
made to k, and kp. 

Sugar results from 1961 to 1973 were much better for k, = 0.4 and 
k, = —1.0 (trading gains 8 per cent per annum) than for k, = 0 and 
k, = —1.0 (trading gains nothing), although both k, were equally success- 
ful for simulated prices. Consequently, the final choice was k, = 0.4. Buy 
and hold was very profitable for cocoa and coffee trading from 1971 to 1976 
so it is not surprising that strategy І gave uninspiring results. The cautious 
choice k, = —1.6 was made. Currency trades were consistently more 
profitable than risk-free investments and the results could notbeimproved 
by small changes to the trading constants chosen previously. 

Table 8.3 summarizes the final selections for k, and k; and the con- 
sequent average annual trading gains, if it had somehow been possible to 
select A, p, Кі, and k; before seeing the calibration prices. The buy and 
hold results are obtained by supposing k, = —1000 and k, = —1001. Trad- 
ing gains are defined as the change in capital using a trading strategy minus 
the change in capital using the appropriate benchmark strategy. 


Evidence against the efficiency of futures markets 217 


Table8.3 Average annual percentage changes in capital 


Benchmark Trading 
strategy Trading strategy gains 
Average Average Average 
(%) k, k, (%) (%) 
Calibration contracts 
Cocoa 61.5 0 —1.6 61.6 0.1 
Coffee 52.3 0 —1.6 49.3 -3.0 
Sugar 26.2 0.4 —1.0 34.3 8.1 
Sterling 12.0 0.8 —0.3 17.6 5.6 
Deutschmark 12.0 0.8 —0.3 20.0 8.0 
Swiss franc 12.0 0.8 —0.3 21.6 9.6 
Test contracts 
Cocoa 19.5 0 —1.6 21.8 2.3 
Coffee 10.5 0 —1.6 12.1 1.6 
Sugar 64.4 0.4 —1.0 91.5 27.1 
Sterling 16.0 0.8 —0.3 23.4 7.4 
Deutschmark 16.0 0.8 —0.3 22.8 6.8 
Swiss franc 16.0 0.8 -0.3 22.8 6.8 


Reproduced (with revisions) from Taylor (1983, p. 190) Бу permission of Basil Blackwell, 
Oxford. 


Test contracts 


The second part of Table 8.3 shows the average annual changes in capital 
for the test contracts. Comparing averages for the trading strategies with 
corresponding averages for the benchmark strategies gives the final 
column of the table. All of these average annual trading gains are positive, 
ranging from 1.6 to 27.1 per cent. These average gains are our major 
evidence for discussing the validity of the efficient market hypothesis. 
They certainly suggest some important inefficiencies have occurred in past 
years. 

Table 8.4 lists the annual percentage changes in capital for the eighteen 
agricultural contracts. Changes are given for buy and hold and strategy ! 
along with the number of days futures were owned. The final column gives 
the number of round trips (one buy and one sell order) for strategy l. The 
assumed efficient market policy, long in futures all year and earning 14 per 
cent interest per annum on money not needed for margin deposits, was 
slightly successful for cocoa, slightly unsuccessful for coffee. Strategy ! 
gave results better by 2 per cent per annum on average for both these 
commodities. Trading results are similar to buy and hold results because k, 
was extremely low. 
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Figure 8.4 Prices and trading decisions (Strategy l) for the December 1976 sugar 
futures contract from December 1975 to November 1976 


The price of the December 1974 sugar contract soared from £90 in 
December 1973 to £560 at the end of November 1974. Buy and hold then 
recorded a massive profit matched almost exactly by the trading strategy. 
Prices declined during the following four years when the trading strategy 
owned futures less than half of the time thereby avoiding buy and hold's 
losses. Buy and hold has a high average return for sugar but strategy l is 
even better, by an impressive 27 per cent per annum. It appears that the 
sugar futures market was inefficient. The conclusions for cocoa and coffee 
futures are less clear. 

Figure 8.4 shows the trading decisions for one of the more successful 
sugar contracts (December 1976). Trades are considered from December 
1975 to November 1976. Prices before this period are shown to the left of 
the dotted line and were only used to obtain initial values for the forecasts. 
Buy and hold would have lost money, buying at £160.2/tonne and selling at 
£121.2. Strategy l, however, would have made money by buying at £166.5 
and selling 152 trading days later at £187.7, after the upward trend had 
ended. No further positions would have been taken. 
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Table8.4 Annual percentage changes in capital for London futures 
Buy and hold with 
interest Strategy! 
Change Days Change Days Round 
(%) owned (%) owned trips 
Cocoa 
1977 121.8 252 121.8 252 1 
1978 1.9 251 —0.5 192 3 
1979 — 17.6 252 —18.0 196 4 
1980 —28.1 250 —21.6 190 3 
1981 19.5 251 27.1 186 3 
Average 19.5 21.8 
Coffee 
1977 -2.5 252 3.6 209 4 
1978 17.7 251 9.4 227 3 
1979 52.2 252 59.8 229 3 
1980 —31.3 252 -33.2 198 5 
1981 16.6 250 20.8 217 4 
Average 10.5 12.1 
Sugar 
1974 535.5 250 537.0 249 1 
1975 —57.4 252 13.5 79 2 
1976 —13.6 254 25.8 152 1 
1977 —20.8 252 5.7 62 1 
1978 —13.2 252 12.8 84 2 
1979 47.5 250 32.7 165 4 
1980 63.2 270 100.2 187 2 
1981 —26.4 251 4.4 106 2 
Average 64.4 91.5 
Overall average 37.0 50.1 
(18 contracts) 


Results before 1981 reproduced from Taylor (1983, p. 191) by permission of Basil Blackwell, 


Oxford. 


Table 8.5 presents the percentage changes in capital over six months for 
trades in each of the eighteen currency contracts. Changes are given for 
risk-free investment at 16 per cent per annum and for strategy 11. Trading 
gains during 1979 and 1980 are, on average, negligible which caused me to 
conclude earlier that the results are consistent with efficient pricing 
(Taylor, 1983, 1985). However, all six 1981 contracts record important 
trading gains. These gains should certainly be viewed as raising consider- 
able doubts about the efficiency of the currency futures markets. IMM limit 
rules would have had no adverse consequences. The gains would be 
higher if the margin deposits could earn interest. 
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Table8.5 Semi-annual changes in capital for Chicago futures 


Risk-free 
Investment Strategy Il 
Change Change Days Round 
(96) (96) owned trips 
Sterling 
June 1979 8.0 9.4 77 2 
Dec. 1979 8.0 13.1 89 2 
June 1980 8.0 12.1 117 3 
Dec. 1980 8.0 7.2 100 3 
June 1981 8.0 15.2 90 3 
Dec. 1981 8.0 13.1 96 5 
Average 8.0 11.7 
Deutschmark 
June 1979 8.0 7.1 86 3 
Dec. 1979 8.0 5.1 77 3 
June 1980 8.0 9.2 92 2 
Dec. 1980 8.0 11.5 86 3 
June 1981 8.0 23.7 105 2 
Dec. 1981 8.0 11.6 101 3 
Average 8.0 11.4 
Swiss franc 
June 1979 8.0 3.3 93 4 
Dec. 1979 8.0 4.9 79 3 
June 1980 8.0 12.5 89 2 
Dec. 1980 8.0 5.0 75 4 
June 1981 8.0 22.0 103 2 
Dec. 1981 8.0 20.7 98 3 
Average 8.0 11.4 
Overall average 8.0 11.5 
(18 contracts) 


Figure 8.5 shows the trades made for the most successful contract 
(deutschmark, June 1981). Prices before December 1980, shown to the left 
of the dotted line, were only used to obtain initial values. Futures would 
have been sold at 0.5306 $/DM and the trade closed 69 trading days later at 
0.4877, then another short position started at 0.4741 and closed out at 
0.4303, at the end of May 1981, 36 days later. 

The trading gains for the test contracts are surprisingly good compared 
with the gains expected after analysing the simulated and calibration 
contracts. Average increases in annual capital are shown in the table 
opposite. The averages in the final row are the arithmetic means of the six 
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Figure 8.5 Prices and trading decisions (Strategy II) for the June 1981 deutschmark 
contract írom December 1980 to May 1981 
Expected increase Actual increase 
Simulated Calibration Test 
contracts contracts contracts 
(%) (%) (%) 
Сосоа 0.7 0.1 2.3 
Coffee 5.9 —3.0 1.6 
Sugar 5.5 8.1 27.1 
Sterling 4.2 5.6 7.4 
Deutschmark 4.2 8.0 6.8 
Swiss franc 4.2 9.6 6.8 
Average 4.1 4.7 8.7 


(Reproduced, with revisions, from Taylor (1983) by permission of Basil Blackwell, 


Oxford.) 
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Table 8.6 Annual profits for a £100 000 trading fund invested in cocoa, coffee, and 


sugar futures 
Buyandhold Strategy | 
Gross Costs Interest Net Gross Costs Interest Net 
Year (£) (£) (£) (£) (£) (£) (£) (£) 
1977 22513 -1000 11357 32860 33410 -2273 12597 43734 
1978 —6407 -1000 9573 2166 -1757 -2363 11380 7 260 
1979 17070 —1000 11310 27380 16463 —3630 11980 24813 
1980 —8797 —1000 11060 1263 5640 —3260 12747 15127 
1981 —5633 -1000 9873 3240 8050 -2643 12013 17420 
Average 3749 -1000 10635 13384 12361 —2834 12143 21670 


Note: Risk-free interest rate 14 per cent p.a. 


figures in the columns. Even after excluding the sugar results the average 
actual increase exceeds the average expected. Thus there is very strong 
evidence that the general pattern of dependence persists for many years 
allowing successful trading rules. 

These futures markets are surely inefficient. No tests of this claim are 
presented since none are known. 


Portfolio results 


Consider a commodity trading fund beginning each year with £100 000 to 
invest. Suppose one-third of the fund is used to trade each of cocoa, 
coffee, and sugar. Then for our assumed interest rate a buy and hold policy 
would have returned 13.4 per cent per annum on average, risk-free 
investment 14.0 per cent, and strategy 1 21.7 per cent on average. Table 8.6 
presents profits earned by a hypothetical fund in each of five years. Gross 
profits are profits from taking positions in futures when trading costs are 
ignored, net profits are gross profits minus trading costs plus bank interest. 
Comparing strategy | with buy and hold, the strategy makes an extra £8612 
gross, earns £1508 more interest and pays an extra £1834 for commission 
etc. to give an extra £8286 net, all figures being annual averages for a 
£100 000 fund. 

Likewise consider a $100000 currency trading fund, invested equally in 
the three currencies investigated. Then strategy 11 would have returned 
23.0 per cent per annum on average, 7 per cent above the assumed 16 per 
cent interest rate. Even allowing for a higher rate on long-term cash 
deposits a fund would have returned attractive profits. Table 8.7 gives 
details for each of three years. Compared with risk-free investment, 
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Table8.7 Annual profits for a $100 000 trading fund invested in currency futures 


Risk-free investment Strategy Il 

Net Gross Costs Interest Net 
Year ($) ($) ($) ($) ($) 
1979 16 000 313 —1163 15 190 14340 
1980 16 000 5173 -1137 15130 19166 
1981 16000 20930 —1150 15 697 35477 
Average 16 000 8805 —1150 15 339 22 994 


strategy ІІ makes $8805 gross by trading, earns $661 less interest and has 
trading costs $1105 to give an extra $6994 net, these figures being annual 
averages for a $100000 fund with interest only paid once a year. 


8.10 TOWARDS CONCLUSIONS 


There is ample evidence in the previous section against the efficiency of 
futures markets. Trading rules constructed from price-trend models are 
successful when applied to the test contracts of all six futures series. The 
rules could have been used to obtain trading profits in excess of efficient 
market predictions in years subsequent to those used for their develop- 
ment. Less sophisticated trend-seeking rules might also have been able to 
earn excess profits. 

Although the efficient market hypothesis has not been proved false, it is 
far more likely to have been false than true for the markets and years 
investigated. The hypothesis cannot be tested until someone describes the 
statistical evidence sufficient to reject it. This task may never be 
accomplished. 

It is obviously difficult to estimate the value of the trading strategies 
described. After analysing prices in 1981 and earlier years, an expected 
annual return 4 to7 per cent above the expected return from either buy and 
hold or risk-free investment appears possible when the value of futures 
positions equals the capital invested. Better trading rules exist, offering 
higher expected returns forthe same levelof risk. Trading results from 1982 
onwards have yet to be calculated. 

Measuring the risk of the trading rules is problematic. Some investors 
and fund managers are likely to consider the risk acceptable, especially if 
they can combine trading with holding a diversified share portfolio. 

Trading rule research is not easy. There are no formal hypothesis tests. 
Further complications arise because the issue of risk premiais crucial when 
the efficient market policy is defined for comparisons with trading variants. 
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Furthermore, many years of prices are required to give any chance of clear 
conclusions. 

Very little autocorrelation is needed to make a market inefficient provid- 
ing it occurs at several lags. It appears markets can be inefficient when all 
autocorrelations (excepting lag 0) are between 0 and 0.03. An important 
statistic for assessing efficiency is the total of the autocorrelations, sum- 
ming over all positive lags. For price-trend models a total above 0.3 might 
be sufficient for inefficiencies, whilst 0.6 almost certainly indicates an 
inefficient market. The latter total is achieved if the trend component 
accounts for 3 per cent of the variance of returns and the mean trend 
duration is 20 days. Low totals suffice because trading costs are relatively 
small. 


8.11 SUMMARY 


Enterprising investors courageous enough to deviate from efficient market 
investment principles would have been rewarded by positive net excess 
returns if they had followed trend-seeking trading rules. Trading any of 
cocoa, coffee, sugar, sterling, deutschmark, and Swiss franc futures would 
have been profitable, even after making realistic assumptions about trad- 
ing costs and the availability of information. Average net excess returns 
range from 2 to 27 per cent per annum and the consistent gains are unlikely 
to be the result of remarkably good luck. 

Theoretical analysis, by mathematical and simulation methods, show 
very little autocorrelation can cause markets to be inefficient. The empirical 
results for practical trading rules strongly support this conclusion. There- 
fore the efficient market hypothesis is probably false for several futures 
markets. Price-trend models play a central role in arriving at this con- 
clusion. 


Chapter 9 -- 


Valuing Options 


The value of an option depends on several variables includingthe variances 
of future returns. This chapter shows how the variance models developed 
in Chapter 3 and the price-trend models of Chapter 7 have important 
implications for option traders. 


9.1 INTRODUCTION 


An American call option is a contract giving its owner the right to buy a fixed 
amount of a specified security at a fixed price at any time on or before a 
given date. The security, often termed the underlying security, could bea 
stock, commodity, or currency, and perhaps a futures contract. Once 
bought an option can usually be sold to someone else before the given 
date, termed the expiration date. Choosing to enforce the contractual 
rights is referred to as exercising the option and the fixed price is termed 
the exercise price. Options which can only be exercised on the expiration 
date are said to be European but are relatively rare, whilst an option to sell 
is called a put option. We only consider call options. An excellent book on 
options has recently been published by Coxand Rubinstein (1985). Another 
informative text is Bookstaber (1981). 

The value of an option depends on obvious variables like the exercise 
price and the expiration date. It also depends on the stochastic process 
governing the price of the underlying security. This short chapter is 
theoretical and discusses consequences of the processes described in this 
book for option traders. 

The next section summarizes the value of a call option when the price 
logarithm changes continuously and at random with the daily returns 
normal and identically distributed. It is based on results by Black and 
Scholes (1973) and Merton (1973). The Black-Scholes formula uses the 
standard deviation of returns and estimates of this statistic are discussed in 
Section 9.3. More appropriate stochastic processes are considered in 
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Sections 9.4 and 9.5, in the first case having changes in conditional 
variances and, in the second, trends in prices. It is shown that option values 
are sensitive to the specification of the process generating returns. 


9.2 BLACK-SCHOLES OPTION PRICING FORMULAE 


Consider a call option on one share of a stock which pays no dividends 
before the option's expiration date. American and European calls have 
identical values when there are no dividends since itis then sub-optimal to 
exercise a call before the expiration date. Let t denote the present time and 
t + Tthe expiration time with T measured in trading days. Also let S = z, be 
the stock's present price and let K be the exercise price. Thus the value of 
the option on day t + Tis Zr — K if this is a positive number, otherwise 
the value is zero. Suppose there is a constant rate of interest rsuch that $1 
invested now is certain to be worth $exp(rT) after T trading days. 

Black and Scholes (1973) assumed the price logarithm followed a con- 
tinuous-time Wiener process: daily returns then have independent and 
identical normal distributions. They demonstrated that the value C of a call 
option depends on K, S, T, rand the standard deviation ø of daily returns. 
Their formula used the cumulative distribution function of the standard- 
ized normal distribution, 


Ф(х) = Qm)? || e 24 qu, 


to give the fair call price 

C = Só(d) — Ke" (d — c VT) (9.2.1) 
with 

d = (log (S/K) + (r  ào?)Ty/(o V T). (9.2.2) 


These formulae follow from the arbitrage properties of a risk-free hedged 
position containing one share of stock balanced by selling short 1/Ф(а) 
calls. As time progresses d changes and thus the hedge should, theoreti- 
cally, be continuously revised. There will be no risk if this can be done, for 
changes in the value of stock will equal changes in the value of the options 
plus interest on the equity invested in the hedge. 

Equation (9.2.1) is the solution of a second-order differential equation 
implied by risk-free hedges. An alternative derivation is helpful for motivat- 
ing results later on. Every hedge and thus every call price is independent of 
investors' risk preferences. Call prices can therefore be obtained by 
assuming stocks and options are traded in a risk-neutral world. Then the 
call value at time t is the discounted expected value at time t + T: 
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C= en (z — K)f(z) dz (9.2.3) 
K 

with f(z) the density function of the price Z,,; when daily returns are 

distributed as N(r — 207, o?) to ensure ҒІ2,;11 = z, exp (rT). To show 

(9.2.1) and (9.2.3) are equivalent we need the following results for the 

density f(z) of a variable whose logarithm is distributed as №, a2): 


| f(z) dz = ®{(u, — log K)/a,} (9.2.4) 
K 
and 
| zf(z) dz = еш? p, + oi — log Ko}. (9.2.5) 
K 


To obtain (9.2.1), let ш = log (S) + (r — 202) Тапа let o, = e VT. 

Black (1976) has shown how to value a call option on a futures contract. 
Assuming zero equity is invested in futures, E[Z,,;] is z, in a risk-neutral 
world. Changing ш; to log (S) — 3To? gives, for European calls, 


С = e "[S@(d) — кФ(а- cV T)] (9.2.6) 
with 
а = (log (S/K) + àc?TV(o VT). (9.2.7) 


All these formulae ignore transaction costs and taxes. The stock formula 
assumes stocks cannot be traded on margin. Generally it is also assumed 
that the price generating process operates over exactly the same periods 
that interest can be earned. For exact results when this assumption is 
replaced by something more realistic see French (1984). 


9.3 EVALUATING STANDARD FORMULAE 


We know that daily returns over long periods of time are not independent 
observations from the same normal distribution. However, it may be 
considered acceptable to make these assumptions for the remaining life of 
an option which will usually be less than six months and perhaps only a few 
days. The Black-Scholes formula then gives fair prices which need not be 
identical to market prices. To calculate C we want c but must instead 
substitute some estimate с into an option pricing formula. Only the 
standard deviation needs to be estimated. 

Some authors advise estimating о by the standard deviation of twenty or 
more recent returns. This appears logical yet the forecast v,,, investigated 
in Chapter 4 may be a better estimate. The forecast attaches greater 
importance to the most recent returns and the weighting scheme can be 
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optimized by considering forecasting accuracy. High and low prices have 
considerable potential to improve the accuracy of standard deviation 
estimates, see Parkinson (1980) and Garman and Klass (1980) for theoretical 
results and Beckers (1983) for some empirical evidence. More research into 
highs and lows is necessary and it should be possible to use them to 
produce an estimate defined like V,,, yet having far greater accuracy. 
Standard deviations of different securities will be interrelated and will 
probably change together, thus a better ó may be obtained by studying 
several securities. 

There are many ways to estimate с and some combination of estimates 
may well be preferable to simpler estimates. The forecasting methodology 
described in Chapter 4 offers a convenient way to compare the accuracy of 
standard deviation estimates and combinations thereof. 

Of course, options traders may be able to use information other than 
past returns to estimate a. Indeed, options markets could be efficient. 
There will nearly always be some o which makes the market price equal to 
the Black-Scholes price and this ø is called the implied standard deviation. 
Relevant empirical work includes Latane and Rendleman (1976), Chiras and 
Manaster (1978), and Beckers (1981b). A suitable combination of the 
implied standard deviation and estimates from past returns will probably 
provide the best estimate of o. 


9.4 CALL VALUES WHEN CONDITIONAL VARIANCES CHANGE 


Assuming daily returns are created by a strict white noise process is not 
very appealing. We have seen this assumption is a serious over-simplifica- 
tion for long series (Section 2.10) and this is also true within the lifetime of 
afutures contract (Section 4.4). Fischer Black, among others, has suggested 
variances regress over time towards some value (Cox and Rubinstein, 1985, 
p. 280). The lognormal, AR(1), product process introduced in Section 3.5 
has this property: the conditional variance tomorrow is expected to be 
closer to a median value than today's conditional variance. It was con- 
cluded in Section 4.5 that this type of stationary process was acceptable for 
modelling some but not all series of returns. We discuss call values firstly 
for a stationary process and later remove this assumption. The returns are 
assumed to have conditional normal distributions and to be uncorrelated, 
tis the present time and the call expires at time t + T. 


Formulae for a stationary process 


Suppose the conditional standard deviations М, V,,,, ..., Vis; are gener- 
ated by the Gaussian process 


log (Van) — e = {log (Vas) — а) + m (9.4.1) 
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for 1 < h < T with 0 < ф < 1. Also the unconditional distribution of 
log (V,,4) is N(a, B°), the unconditional variance of V,.,, is 


сё = exp (2a + 28?) 


and the m, are independently distributed as N(0, 8711 — ¢7]). Given a 
realization v, of V, suppose the return X, has distribution №, v?) for some 
constant м. To obtain a continuous-time stochastic process, say 
(Z(s), s real), we could assume the price logarithm follows a Wiener 
process during each trading day with 


log {Z(t + h + c + dYZ(t + h + o) ~ Миа, vind), 


given an observable realization v,,, and any c, dhavingO0 < c < c + d < 1. 

We now revise the purely theoretical arguments used to obtain the 
Black-Scholes value of a call option. Theoretically, perfect investors could 
quickly calculate v,,, from the continuous price record and thus could 
create risk-free hedges at all times. These hedges will not depend on risk 
preferences and therefore fair call prices can be obtained by assuming a 
risk-neutral world. Thus equation (9.2.3) is again appropriate and we need 
the density function of the price Z,,; on the option's expiration date. 

At time t we know the current price z, and will assume v, is also known. 
Given this information, 


; 
log (Zr) = log (z) + У Xan 
h=1 


has conditional mean ш; = log (z) + Tu and conditional variance 


2 
от = У) F[V I, |w] (9.4.2) 
h-1 
because the returns X,,,, are uncorrelated and E[X2,,|v,] = E[V2,,|v,]. To 
evaluate (9.4.2) we apply the AR(1) definition, (9.4.1), to give 


log (V..&|v) ~ Ма + "ilog (v) — а], n — ф?”]) 
and hence 


E[V2,,|v,] = exp {2a + 2ф Шор (м) — а] + 282[1 — $?^]). 
(9.4.3) 


It is then possible to calculate a2. 

To find the value of иіп a risk-neutral world it is necessary to assume the 
total X, ,, + =} + Жут has a normal distribution for any given v,. This is not 
true for T > 1 but should permit an excellent approximation to correct call 
values. Making the assumption, ші r — 304/T for stocks in a risk-neutral 
world and note this depends on v,. Applying the normal assumption a 
second time it can be deduced that a call value for a product process can be 
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Figure 9.1 Fair values for in-the-money call options (K/S — 0.9). Values from the 
product process formula (solid curve) and the Black-Scholes formula (dotted 
curve) plotted against the present standard deviation 


obtained by the following modification of the Black-Scholes formula, 
(9.2.1), (9.2.2): 


h-i 


T 1⁄2 
replace a VT by o, = |> вим) А (9.4.4) 


To make the replacement use (9.4.3) and, in practice, v,, а, B, and ó will 
have to be replaced by estimates. The same revision is applicable to stocks 
and futures. 


Examples 


Comparisons between the product process and Black-Scholes formulae 
are given for stock options four months from expiration (T = 84 trading 
days) with a 10 per cent annual risk-free interest rate (R = log [1.1]/252). For 
the product process | have supposed ф = 0.985, B = 0.5, and an overall 
volatility sd(X,) = оу = ov = 0.02, hence a = —4.16, in line with the esti- 
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Figure 9.2 Fair values for at-the-money call options (K = S) calculated from the 
product process formula (solid curve) and Black-Scholes formula (dotted curve) 


mates presented in Tables 3.2 and 3.3. A present stock price of $100 (=S) is 
considered and options with exercise prices $90, $100, or $110 (=K). 

Figures 9.1,9.2, and 9.3 illustrate call values for the three exercise prices. 
The solid curves represent CPP(v,), the call value according to the product 
process formula when the latest conditional standard deviation is у,. The 
dotted curves show CBS(o), the Black-Scholes value when all standard 
deviations are о. All curves are plotted for the approximate 95 рег cent 
range for V,: exp (a — 28) < v, < exp (а + 28). 

The product process formula provides a higher value than the Black- 
Scholes formula for low current standard deviations v,, i.e. CPP(v,) then 
exceeds CBS(v,), and vice versa for high v,. This is to be expected. If v, is 
lower than the median level exp (a) than v,,,, is expected to increase as h 
increases and the anticipated additional future volatility makes the call 
more valuable. The percentage difference between CPP(v,) and CBS(v,) is 
particularly large for low v, and the out-of-the-money option (K — 110, 
Figure 9.3). It is also large, but to a lesser degree, when v, is high for the 
same option. 
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Figure 9.3 Fair values for out-of-the-money call options (K/S = 1.1) calculated 
from the product process formula (solid curve) and Black-Scholes formula (dotted 
curve) 


Table 9.1 Fair call option values for Gaussian white noise and various product 


processes 
Conditional Product process 
standard Black- 
deviation v, K Scholes ф = 0.98,0.985,0.99,0.995 о, = 0.01,0.03, 0.05 
0.0094 90 13.01 13.91 13.72 13.49 13.24 13.01 13.05 13.13 
100 5.19 7.14 6.81 6.38 5.83 5.20 5.31 5.56 
110 1.20 3.01 2.69 2.28 1.78 1.22 1.31 1.53 
0.0156 90 14.20 14.74 14.65 14.52 14.31 14.02 14.14 14.42 
100 7.30 8.41 8.28 8.08 7.78 7.33 7.52 7.94 
110 3.17 426 4.13 394 363 3.19 338 3.79 
0.0257 90 16.61 16.06 16.21 16.38 16.53 16.64 16.90 17.48 
100 10.87 10.18 10.37 10.57 10.77 10.91 11.23 11.93 
110 6.74 6.04 6.23 6.44 664 678 7.41 7.83 


Notes: Present stock price 5 = 100, option life T = 84 trading days, annual interest rate 10 per 
cent, unconditional standard deviation 0.02 for the product process. 
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Table 9.1 illustrates the call prices for four values of $. These are 0.98, 
0.985, 0.99, and 0.995 and the table gives figures for log (v) = а — B, and 
а + B. It can again be seen that the greatest percentage differences 
between CPP(v,) and CBS(v,) occur for low v, and a high ratio K/S. These 
differences increase as ф decreases. 


Non-stationary processes 


Conditional variances may not regress towards a median value. Instead of 
supposing ф is just less than 1 it may be more logical to consider the 
non-stationary case ф = 1 for some series. All the previous results still 
apply except now 


ElV} , |w] = v; exp (2ho2) (9.4.5) 


with c, the standard deviation of the residual terms in the random walk 
process 


log (Vien) = log (М+һ-1) + Meh- (9.4.6) 
The returns process is now non-stationary: 
var (X,.,)/var (X) = exp (205) # 1, 


assuming о, > 0 and var (Х,) is finite. 

The Black-Scholes formula will always underestimate the true call value 
when (9.4.6) applies, indeed CPP(v) — CBS(v,) is a positive monotonic 
function of о, for all exercise prices. It is not easy to suggest sensible 
values of o; but it is unlikely that a value above 0.05 will be appropriate. 
The right-hand columns of Table 9.1 show CPP(v) with other variables 
(excluding ф) as for the other columns of the table. Again the per- 
centage differences are greatest for the option out-of-the-money 
(K = 110) although the differences for ф = 1 are generally less than for 
ó «1. 


Conclusions 


The stochastic behaviour of the volatility of the underlying security 
is very important when valuing an option. Ignoring possible future 
changes in volatility can give a seriously incorrect call value. So, un- 
fortunately, can using a stationary model when a non-stationary one is 
appropriate, and vice versa. Further research into volatilities is clearly 
desirable. 
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9.5 PRICE TRENDS AND CALL VALUES 


The evidence in Chapters 6 to 8 strongly suggests there have been trends 
in the prices of commodity and currency futures. Informed investors will 
probably consider both an underlying security and its options to be 
mispriced when they believe there are trends in the security's prices. 
Naturally, trends can only continue to exist if enough traders use informa- 
tion inefficiently. 

Price-trends do not exist in the theoretical models used to derive option 
pricing formulae. It is thus difficult to assess the value of an option to an 
investor having superior information about the price process. Trends 
apparently have no consequences for the arbitrage arguments used to 
price options. For example, when prices follow a binomial process, so each 
transaction changes the price from some S to either uS or dS (u and d 
known), then superior information about the probabilities of each possible 
change is irrelevant when pricing a call by arbitrage (Cox and Rubinstein, 
1985, pp. 171-8). Risk-adjusted profits from superior knowledge could 
presumably be obtained from a position in the security, or its options, or 
certain combinations thereof, but not necessarily by arbitrage. It is now 
argued that trading options may then be more profitable than trading the 
security, assumed to be a futures contract. 


A formula for trend models 


Someone who believes there are trends in prices will not believe returns 
are uncorrelated and thus will not believe the variance of the total 
return È Xn = X4 + ++: + Хт is the sum of the variances of individual 
returns X,,,. The person may want to assess options using a variance for 
E Xan based upon a trend model since option values depend on such 
variances. A formula for assessing call options when there are trends is 
discussed soon. A linear Gaussian process is assumed to keep the 
equations simple. It is possible, but complicated, to incorporate changing 
conditional variances into the formula by adapting the methods used in 
Section 9.4. For simplicity again, the formula ignores any information 
summarized by the observed forecast f, of x; 1. Such information might 
more naturally be used to consider trading futures. In contrast, superior 
information about the variance of X X,,, should be used to trade 
options. 

Variances are represented in the Black-Scholes formula for stocks and 
Black's adaptation for futures by the product a VT, which is intended to 
equal the standard deviation of > X,,,,. Better informed traders may want to 
include unhedged calls in their portfolios when the true standard deviation 
exceeds the value believed by other traders, say e VT. To help select 
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Figure 9.4 Values for an at-the-money call option (K = S) calculated from a 
formula that assumes trends (solid curve) and from Black's formula for futures 
(dotted and dashed curves) 


profitable calls it is suggested that calls are assessed by replacing aV T with 
the correct standard deviation for X X,+, in a price-trend model whose daily 
returns have variance c? and autocorrelations Ap”. From (7.5.12), this 
price-trend formula for assessing call options is given by the following 
modification of the pricing formulae given in Section 9.2: 


replace a VT by o, = alT + 2Ap(T — (1— py(1— р)}/(1 — p)’. 
(9.5.1) 


It is emphasized that this change does not provide a pricing formula based 
on risk-free hedges. Instead, the price-trend formula might help to identify 
underpriced calls. Buying such calls should increase the risk-adjusted 
return expected from a sufficiently diversified portfolio. 


Examples 


As in the previous comparisons it is supposed that the present price S is 
$100, the exercise price Kis $90, $100, or $110 and the annual interest rate is 
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Figure9.5 Values for an out-of-the-money call option (K/S = 1.1) calculated from 
a trend formula (solid curve) and Black's formula (dotted and dashed curves) 


Table9.2 Call values when there are trends in prices 


K = 90 K = 100 K = 110 
Standard deviation a T CB? CPT^ CB CPT CB CPT 
0.01 20 9.94 9.97 1.77 2.10 0.03 0.08 
40 9.97 10.19 2.48 3.18 0.19 0.49 
80 10.18 10.90 3.46 4.72 0.67 1.54 
120 10.44 11.57 4.17 5.83 1.16 2.49 
0.02 20 10.42 10.77 3.54 4.20 0.68 1.11 
40 11.19 12.20 4.97 6.36 1.70 2.88 
80 12.52 14.62 6.91 9.42 3.41 5.78 
120 13.59 16.46 8.34 11.62 4.79 8.00 
0.03 20 11.48 12.21 5.31 6.30 1.96 2.80 
40 13.06 14.81 7.44 9.53 3.85 5.84 
80 15.43 18.78 10.35 14.08 6.70 10.43 
120 17.22 21.68 12.47 17.35 8.86 13.82 


*Call value using Black’s formula. 
^Value when the formula is adapted for price-trends. 
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10 per cent. Initially the standard deviation с is assumed to be 0.02. The 
parameters of the trend model are A — 0.03 and p — 0.95 for all the 
comparisons. These values are similar to many of the estimates obtained in 
Chapter 7. 

Figures 9.4 and 9.5 show call values for futures when K is $100 and $110. 
The solid curves represent CPT(7), the call value when the price-trend 
formula is used and T trading days remain until the option expires. Dotted 
curves indicate CB(7), the value given by Black's formula. Both CPT(T) and 
CB(T) assume perfect estimates of ø. Estimates will actually depend on the 
time between prices in a sample. A perfect estimate of the variance over 
one week, о2 = var (log [Z,,s/Z,]), will lead to a different option value 
using Black's formula if a VT is replaced by o; V(T/5). The corresponding 
call values are shown by the intermediate, dashed curves. 

It is apparent that call options priced by the standard formulae might be 
considered bargains by investors convinced there is a trend component in 
returns. Table 9.2 lists values for Black’s formula and the trend formula, for 
о = 0.01, 0.02, and 0.03. Once more the percentage differences are great- 
est for the out-of-the-money option. For K = 1.15, c = 0.02, and 
120 > T > 25, the trend formula gives values 65 to 70 per cent above the 
standard value; the percentage increase when c = 0.01 declines with T to 
114 per cent when 7 = 120, whilst for = = 0.03 it grows with T over the 
range 100 = T > 25 from 46 to 56 per cent. 


9.6 SUMMARY 


A very important determinant of option values is the variance of the total 
return over the remaining life of the option. Estimates of this variance 
should reflect the fact that (conditional) variances will not have been 
constant in the past nor can they be expected to be constant until the 
option’s expiration date. It is quite possible that model-based estimates of 
variance are better than market estimates and there could then be 
economic gains from option trading. Empirical research into this possibility 
should yield interesting results. 


Chapter 10 - 


Concluding Remarks 


It has been shown that describing the statistical behaviour of financial 
prices accurately is a formidable task. This concluding chapter summarizes 
important features of the price generating process, highlights major impli- 
cations for buyers and sellers of financial assets, and notes interesting 
questions for further research. 


10.1 PRICE BEHAVIOUR 


The stochastic process generating daily returns and hence daily prices is 
not like the processes commonly described in statistical texts and taught to 
students. Linear processes cannot be used to model returns. In particular 
returns are not independent observations from some fixed distribution. 
Neither can returns be modelled by a linear combination of the present 
and past terms from such a process, as shown at the end of Chapter 2. 
Returns must instead be modelled by some non-linear stochastic process. 
Additionally, it may be necessary to consider non-stationary processes. 
Relatively little is known about relevant non-linear processes which makes 
the modelling of returns a challenging activity. 

Non-linear behaviour is almost certainly a consequence of changes in 
the variances of returns caused by variations in the general level of market 
activity. This level depends upon the preferences of investors and other 
traders and upon the quantity and importance of relevant information 
about the asset being traded. Progress has been made in Chapters 3 and 4 
towards finding practical models for the variances of returns conditional 
upon appropriate information sets. 

Prices may look like random walks but they are not. Daily returns are not 
uncorrelated observations sharing a common mean, neither are they 
uncorrelated with means fixed by economic theories of equilibrium. These 
conclusions were obtained in Chapter 6 by using appropriate statistical 
tests. Considerable care is required as many popular tests are unreliable 
when the process is non-linear and lack test power when there are trends 
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in prices. Rejection of the random walk hypothesis indicates that some 
information is not reflected accurately by prices on the day it first becomes 
known. The delay is minimal for stocks but appears to be important for 
currencies and commodities. Thus accurate models for the returns from 
many financial assets must be non-linear and possess non-zero auto- 
correlations at some or all positive lags. 

Prices reflect most information efficiently. Some markets, particularly 
for futures, do not appear to be sufficiently efficient to prevent successful 
trading using rules based upon statistical models. There is astrong case for 
trends in prices as a credible alternative to random behaviour for certain 
currencies and commodities. The realistic assessments of trading rules in 
Chapter 8 indicate profitable results were possible during years sub- 
sequent to the data used to define the rules. 


10.2 ADVICE TO TRADERS 


Private and corporate investors, businessmen, primary producers, anyone 
involved in international trade and the brokers and analysts who advise 
these people can all benefit from a deeper understanding of price 
behaviour. Many traders are rightly concerned about the risks associated 
with uncertain changes in prices. These risks can frequently be sum- 
marized by the variances of future returns, perhaps directly or by their 
relationship with relevant covariances in a portfolio context. Variances 
change and within a few weeks or even days they can increase dramatically 
and likewise the risks faced by traders. The forecasts of future standard 
deviations investigated in Chapter 4 can provide up-to-date indications of 
risk, which might be used to avoid unacceptable risks perhaps by hedging. 
These forecasts can be improved (see Section 10.3) and the methodology 
given here allows readers to develop their own improvements. 

There will always be some traders who are best called speculators, 
having neither need of the goods they trade nor intention to possess them 
for long. In so far that speculators accept risks non-speculators wish to 
avoid, speculation can be respectable. Informed speculation might even 
be called investment. Speculators should consider the implications firstly 
ofthe trend models, particularly with regard to futures, and secondly of the 
variance models, particularly with regard to options. 

No responsible adviser would claim profits can be made easily by trading 
financial goods. Markets use information at least reasonably efficiently and 
large profits can only be obtained by successfully enduring large risks. 
Nevertheless, the models presented in this text have the potential to 
modestly increase the returns obtained by sensible speculators. Readers 
wishing to estimate model parameters from their own price series can use 
the computer program given in the appendix. 
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10.3 FURTHER RESEARCH 


Our understanding of financial time series is reasonably complete. There 
are, however, several opportunities for further research. New markets 
continue to be opened and many people will want to analyse their prices. 
Research into further series must recognize the non-linear behaviour of 
returns and must avoid the assumption that returns are independently and 
identically distributed. It may well be necessary to consider the possibility 
that prices do not follow a random walk. 

Forecasts of future conditional standard deviations can almost certainly 
be improved. Daily high and low prices should be used to obtain better 
forecasts and so too should standard deviations implied by option prices 
whenever available. Seasonal effects, such as a higher standard deviation 
for returns encompassing weekends, should be considered. The best 
forecasts can be used for interesting tests of the efficiency of options 
markets. Comparisons of forecasts derived from stationary and non- 
stationary models (as in Chapter 4) should be instructive. Better forecasts 
will be useful when defining rescaled returns for random walk tests. 

Modelling any trend component in returns requires some specification 
of the interdependence (if any) of the trend and the fluctuating variance of 
returns. This is not straightforward, as shown in Chapter 7. Further research 
is desirable and an attempt to find unbiased estimates of the trend 
parameters should be made. Such research has the potential to improve 
the trading rules discussed in Chapter 8. Naturally more can be done 
towards investigating the efficiency of futures markets and further research 
may well be rewarding. Further results about the vexatious issue of 
whether or not risk premia exist for futures would assist such research. 

This book has only discussed models for univariate time series. Multi- 
variate series are defined by several observations made at the same time, 
for example the daily closing prices of two or more financial assets. 
Interesting examples are firstly the prices of several stocks, secondly the 
prices of copper, lead, tin, zinc, and other metals, and thirdly key prices 
like gold, the national stock index, exchange rates with major trading 
partners, interest rates, and the price of oil. Each univariate price series 
defines a series of returns and a set of these series defines a multivariate 
returns series. 

Modelling the interdependence between the returns from different 
assets would be very interesting. The largest cross-correlation between the 
returns X; and x; in series į and j will be contemporaneous if markets аге 
efficient, i.e. when the times s and t are equal. Lead and lag relationships 
(non-zero correlation for some / # j and s # t) must be considered al- 
though any correlations are likely to be small. The cross-correlations 
between absolute returns are important. They should help to summarize 
the interdependence of price volatilities for different assets. Standard 
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deviations for some assets may well depend on the same statistic for other 
assets and lagged relationships are conceivable. Accurate models will 
probably be multivariate generalizations of the univariate models already 
described. It would be easiest to begin by supposing all univariate models 
for returns are uncorrelated and non-linear. Cross-correlations for returns 
and absolute returns might then indicate a suitable category of multi- 
variate, non-linear models. 


10.4 STATIONARY MODELS 


All models are necessarily imperfect and can only be approximations to 
the highly complicated process which produces observed prices. Some 
models, however, are certainly better than others. Stationary stochastic 
processes often appear to be acceptable for daily returns. A summary of 
the best stationary models described earlier concludes our study of 
financial time series. Recall that the price process (Z) responsible for 
generating daily prices indexed by the time variable t is non-stationary, 
whilst the returns process (X,) may well be considered stationary, it being 
defined by X, = log (Z) — log (4,_1). 


Random walks 


The logarithms of prices follow a random walk when returns have identical 
means and are uncorrelated, i.e. when 


E[X] = ш and сог (Х,, X,,) = 0 


for some constant ш and all times t and all positive lags т. Two Gaussian 
processes can then be combined to give a stationary and uncorrelated 
process capable of explaining the major properties of returns. Prices have 
a tendency to change more often on some days than on others and this is 
described by the conditional standard deviation V,. The reaction of the 
price at this level of market activity is summarized by an independent 
random variable U,. An appropriate model is: 


X = u + VU, (10.4.1) 
log (V) — a = o{log (V1) — a} + m, (10.4.2) 


with (a) the U, have independent normal distributions, all with mean 0 and 
variance 1, (b) the m, have independent and identical normal distributions, 
with mean zero, so (log (М,)} is a Gaussian, autoregressive process, and (c) 
the processes (U,) and { V,} are stochastically independent of each other. 

This model was studied from Chapter 3 onwards. There are four par- 
ameters: the mean return и and the mean о, standard deviation В and 
autoregressive parameter ф of the process (log (V,)}. The innovation x, 
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determines V,, the measure of market activity on day t. Small seasonal 
adjustments can be incorporated into the model to make it consistent with 
the day, weekend, and month effects noted in Section 2.5. 


Price trends 


The returns from various commodities and currencies display positive 
dependence. This is consistent with trends in prices. Let u be the average 
return, as before, and let x, — u summarize the effect of slowly interpreted 
information upon the return during day t. The proportion of the variation 
in returns due to trends is denoted by A = var (u)/var (X). Then the best 
generalization of (10.4.1) and (10.4.2) found so far has autocorrelations 


сог (X, Xa) = Ap’, A, р, т> 0 
апа іѕ defined Бу 


X = щш + М0", (10.4.3) 
ш = ш = pua — ш) + À, (10.4.4) 


and (М) as previously, see (10.4.2), with, furthermore: (i) each of the three 
processes 44), (m), and (UT) consists of independent and identically 
distributed normal variables having zero mean, (ii) the three processes are 
stochastically independent and (iii) their variances are var (2) = (1 — $?)8?, 
var (0%) = (1 — A){1 — A + A exp (482), and var (Z) = A(1 — p2)o2 
with 
т? = var (X) = exp Qa + 28?)/(1 — А + A exp (48?)). 

This is the linear trend model defined in Section 7.2 with minor alterations 
to the notation. There are six parameters: A and p determine the average 


magnitude and duration of the trends whilst и, a, B, and ¢ are defined as in 
the random walk model. 


Appendix  .. 


A Computer Program for Modelling Financial 
Time Series 


This appendix describes a FORTRAN program which calculates many of the 
statistics and parameter estimates presented in the main text. The program 
is intended to help readers to model their own series. FORTRAN code is 
presented after a description of what the program does and the data it 
requires. 


Output produced 


The program produces the following information after reading a time 
series: 


(i) Summary statistics for returns, their number n, average x, standard 


(H 


(ii 


(iv 


= 


— 


м 


= 


deviation s, variance $°, test value for zero mean t = \/nx/s, skewness 
band kurtosis k (equations (2.3.1)); also the annual Compound rate of 
return G and the estimated simple annual rate of return A (see notes 
to Table 2.3). 

Numbers of outliers, being the numbers of returns more than k 
standard deviations away from the mean for integers К > 2 (as in 
Tables 2.6 and 2.7). 

The autocorrelations of returns, absolute returns, and squared 
returns, for lags between 1 and 30 trading days (equation (2.9.1) for 
returns and similarly for the transformed series); also the standard 
error for strict white noise, 1/\/n, and the associated 95 per cent 
confidence interval, +1.96/\/n. 

Parameter estimates for the process generating standard deviations: 
estimates i, and Gy of the mean and standard deviation of the process 
(М) (equations (3.8.2)); further estimates given by assuming log (V,) 
has the distribution N(a, B°), namely @ and В (equations (3.9.1) and 
(3.9.2), an estimate for the median of V,, exp (å), and an estimated 95 
per cent interval, from exp (à — 1.968) to exp (à + 1.968). 
Estimates for forecasting absolute returns and conditional standard 
deviations: Ф and 6 for the best stationary forecast, given by suppos- 
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ing log (V) has autocorrelations $* then obtaining В and ф by 
minimizing the function Fç (see Section 3.9) and hence getting ó by 
appropriate substitutions in equation (4.3.2) (these estimates define 
the third forecast given in Section 4.3); also ñ,, given by equation 
(3.9.5). 

(vi) Estimates of the autocorrelation variances, multiplied by the number 
of observations: b, for returns (equation (5.7.1)) and b* for rescaled 
returns (equation (5.8.4)), for lags 1 to 30. 

(vii) The autocorrelations of rescaled returns, for lags 1 to 30 (equations in 
Section 6.8). 
(viii) Random walk test statistics calculated from the autocorrelations of 
rescaled returns: T* (equation (6.6.2)), U* (equation (6.7.1)), r, vn*, 
N,, Оз, Qo, and Qs, (see Section 6.4); also the results when T* and 
U* are calculated from the autocorrelations of returns (Section 6.9). 
(ix) Runs analysis, in particular the runs test statistic K (equation (6.4.8)). 
(x) The estimated spectral density function, namely Қо) defined by 
equations (6.4.4) and (6.4.5) and the standardized statistics f, equation 
(6.4.6). These numbers are followed by the random walk test statistics 
fy (equation (6.6.3)), f, and N, (Section 6.4). 
Estimates of the price-trend model parameters provided by minimiz- 
ing the functions F, and F, (equations (7.3.2) and (7.3.3)). The numbers 
A; „апа S; m are given, then the estimates тї; and A, followed by p; and 
Q, (see Section 7.3). 


(xi 


— 


Computer time required 


The program was tested on a VAX 11/780 computer running under the 
DEC/VAX operating system VMS version V3.5. Compilation and linking 
took 25 seconds of central processing unit (CPU) time. A further 16 seconds 
of CPU time were sufficient to execute the program on one of the US stock 
returns series (2750 observations). 


User-defined parameters 


Users may have to change some of the statements following the first set of 
comments and perhaps numbers in the REAL declaration. 

The identifier G stores the value of y used in the calculations of v, 
(equation (5.8.3)). Replace С = 0.04 by С = 0.1 for commodity and 
currency series. 

For a spot series retain the statement K = 1 as in the program listing. For 
a series made up from futures contracts, K must equal the number of 
contracts. 
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The next two statements define the values of NRFIT and NRM. These 
values are respectively the number of autocorrelations used firstly to 
estimate the trend model parameters and secondly the variance model 
parameters. NRFIT is the integer Kin equation (7.3.2), NRM is the integer K 
in Section 3.9. 

The value of NRSP is one more than the number of autocorrelations used 
in the calculation of the spectral density function. NRSP is the integer M in 
the definition of the Parzen weights (equation (6.4.5)). 

The integer XSIZE equals the size of the arrays X, Y, and SQUARES. Any 
change to the value given in the program will also require changes in the 
final line of the REAL declaration. It is likely that XSIZE should be decreased 
for a futures series, perhaps to 300. 

Note that the dimensions of the matrix Z, defined at the end of the REAL 
declaration, must be at least XSIZE by K. 


Optional parameters 


Changing the values of NRX and/or NVARR in the fourth DATA statement 
will change the number of autocorrelations calculated and/or the number 
of autocorrelation variances estimated. 


Input requirements 


The user must ensure that the following data is read before the comment 
"TAKE LOGS": 


(i) Prices Z(J, 1) with | referring to the contract number and J counting the 
number of trading days since some starting date. | is always 1 for spot 
data. 

(ii) Integers FIRSTY(I) and 1А5ТҮ(І) defining the first and last days for which 
rescaled returns are calculated for inclusion in the autocorrelation 
calculations; | indicates the contract number. 


Two examples will, I trust, make these requirements clear. 

First, consider a series of 2000 spot prices, say from 2 January 1978 to 31 
December 1985 with the price moving up from 120.5 to 216.7. The following 
datafile could be read satisfactorily without altering the program. 


22 2000 2000 
20178 120.5 


2000 lines 30178 12222 


311285 216.7 
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Here FIRSTY(1) is 22 to allow 20 returns to be used to calculate the initial 
standard deviation forecast. Prices 21 and 22 are used to obtain the first 
return appearing in the autocorrelation calculations for random walk tests. 
The other two numbers in the first line are LASTY(1) and NPRICES, the 
number of prices to be read. 

Second, consider a series defined by 20 futures contracts with prices 
from February 1976 to December 1985. Suppose these contracts have 
delivery dates June 1976, December 1976, June 1977, ..., December 1985. A 
satisfactory data file could commence like this: 


22 90 101 
20276 1.675 
90lines 20276 1 Qu 
310576 1.568 First contract 
10676 1.582 
11 lines | : : 
150676 1.591 
66 191 201 
10376 1.612 
65 lines | kaa 1 ы 
310576 1.603 
| 10676 1.597 Second contract 
126 lines : : 
301176 1.431 
11276 1.431 
10lines | i : 
141276 1.415 


There would then be similarly formatted data like the second contract Юг 
the remaining 18 contracts. 

In this example FIRSTY(1) is again 22 to permitthe calculation of essential 
initial values whilst LASTY(1) is 90 and ensures that the June prices for the 
June 1976 contract are not used in the calculations. In the first row of the 
file NPRICES is 90 + 11 = 101 and equals the number of prices for the June 
1976 contract in the file. Note FIRSTY(2) is 66, to make sure the first return 
from the second contract appearing in the time series uses the prices on 31 
May 1976 and 1 June 1976. Twenty returns from the contract before 1 June 
1976 are only used to calculate initial values. Also note LASTY(2) is 
65 + 126 = 191 and the second value of NPRICES is 65 + 126 + 10 = 201. 
It would be necessary to alter the final statement in the REAL declaration 
and the statements defining K and XSIZE, perhaps using 
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4SQUARES(300), TYPE(3), X(300), Y(300), Z(300, 20) 
K = 20 


XSIZE = 300 


About the subroutines 


ACF uses the data X(J1), X(J1 + 1), ..., X(J2) stored in an array of size N to 
calculate the autocorrelations R(1), R2), ..., R(NR) up to lag NR. M denotes 
the size of the arrays C and R. Note the observations must have total 
X(J1) + X(J1 + 1) + ::: + X02) equal to zero (spot data) or total over all 
contracts equal to zero (futures data). Also see the final sentence in this 
subsection. 

FIT uses autocorrelations К(І1),..., R(I2) calculated from N observations 
to find the best trend model estimates MHAT and AHAT constrained by 
MHAT x MMAX (rfi and Á in Chapter 7). 


FITV uses appropriate autocorrelations R(1), ..., R(J1) calculated from М 
observations to find the best estimates BHAT and PHIHAT (B and ó in 
Chapter 3). 


MAPARA finds the moving average parameter Q giving autocorrelations 
АР” when the autoregressive parameter in an AMRA(1, 1) model is P. 

RUNSTEST stores the number of negative runs in S(1), positive runs in 
5(2) and no-change runs in S(3) for a series of returns X(N1), ..., Х(М2). Kis 
the standardized test statistic and E2 is a very small positive number. 

SPECTRAL uses autocorrelations R(1), ..., R(M1 — 2) to calculate spectral 
density estimates SPEC(1), ..., SPEC(M1) corresponding to frequencies 
w = (j — 1)m/(M1 — 1) for j = 1 to M1. The array SPECD gives the corre- 
sponding standardized values when there are N observations. NSIG counts 


the number of significant estimates for the subset] — 1,5,9, ..., M1. 
VARR is used to estimate the variances of the first NV autocorrelations 
with XBAR the average of all the observations stored in Х(Ј1), ..., X(J2), 


averaging over all contracts. 

Subroutines ACF, RUNSTEST, and VARR are called once for each futures 
contract and the results are only meaningful after the whole set of calls has 
been made. 
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Fortran program 


оооооо 


ооо ооо 


ооо 


30 
60 


90 


100 


110 


PROGRAM ABCD 

INTEGER ASIZE,BSIZE,CSIZE,FIRSTX(50),FIRSTY(50),LAST(50), 
10UTS(20,3),SSIZE,XSIZE 

REAL AEST1(100),AEST2(100),B(30), BSTAR(30) , CABS(100) , CM(100), 
1CSQD(100),CX(100),CY(100),FIRSTV(50), MBAR, MINDIF,MSUM,N,NY, 
2RABS(100),RM(100),RSQD(100),RX(100),RY(100), 
3SMIN1(100),SMIN2(100),SPEC(101),SPECD(101),SPECSD(101), 
4SQUARES(3000), TYPE(3),X(3000),Y(3000),Z2(3000, 1) 

DATA ASIZE,BSIZE,CSIZE,SSIZE/100,30,100,101/ 

DATA VM,VX,VY,VABS,VSQD,CM,CX,CY,CABS, CSQD/505*0.0/ 

DATA RUNS,TYPE,MINDIF,N28,DELTA/4*0.0,0.000001,28,0.798/ 
DATA NRX,NVARR,OUTS/30,30,60*0/ 

DATA ABSXSUM,XXSUM,MSUM, XSUM, YSUM, X2SUM, X3SUM, X4SUM/8*0.0/ 
DATA DENX,DENY,B,BSTAR,TX,TY,UX,UY/66*0.0/ 


NUMBERS SPECIFIED BY PROGRAM USER, EXAMPLE FOLLOWS 
MAX. VALUES FOR NRFIT, NRM AND NRSP ARE 100,100 AND 101 
DIMENSIONS OF X, Y AND SQUARES MUST EQUAL XSIZE 
SIZE OF MATRIX Z MUST BE AT LEAST XSIZE BY K 


G=0.04 

K=1 

NRFIT=50 

NRM=50 

NRSP=100 

XSIZE=3000 

G1-1.0-G 

NRY-NRSP 
IF(NRY.LT.30)NRY-30 


READ IN PRICES, EXAMPLE FOLLOWS 


DO 60 I=1,K 
READ(1,*)FIRSTY(I),LAST(I),NPRICES 
DO 30 J-1,NPRICES 
READ(1,*)IDATE,Z(J,I) 

CONTINUE 

CONTINUE 


TAKE LOGS 


DO 90 I-1,K 
IF(FIRSTY(I).LT.22)FIRSTY(I)-22 
FIRSTX(I)-FIRSTY(I) 

DO 90 J=1,LAST(I) 
Z(J,I)-ALOG(Z(J,I)) 

FIRSTX(1)-2 


CALCULATE SUMMARY STATISTICS (SECTION 2.3) BEGINNING WITH XBAR 


N=0.0 

XSUM=0.0 

DO 100 I-1,K 

N=N+FLOAT( LAST(I)-FIRSTX(I) +1) 
XSUM-XSUM*Z(LAST(I),I)-Z(FIRSTX(I)-1,I) 
XBAR-XSUM/N 

N1=N+O.5 

DO 120 I=1,K 

21-ЕІК5ТҮ(І)-20 

J2-FIRSTY(I)-1 

VHAT=0.0 

DO 110 J=J1,J2 
VHAT=VHAT+0.05*ABS(Z(J,1)-Z(J-1,1)~-XBAR) 
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FIRSTV(I)=VHAT 
DO 120 J=FIRSTX(I),LAST(I) 
XJ=Z(J,1)-Z(J-1,1) 
ABSXSUM=ABSXSUM+ABS ( XJ} 
XXSUM=XXSUM+XJ * XJ 
XJ=XJ-XBAR 
MSUM=MSUM+ABS ( XJ) 
X2SUM=X2SUM+XJ*XJ 
X3SUM=X3SUM+XJ**3 
X4SUM=X4SUM+XJ**4 
IF(J.LT.FIRSTY(I))GOTO 120 
YSUM=YSUM+ (XJ/VHAT) 
VHAT=G1 * VHAT+G *ABS (XJ) 

120 CONTINUE 
ABSXBAR=ABSXSUM/N 
MBAR=MSUM/N 
SBAR=X2SUM/N 
XXBAR=XXSUM/N 
YBAR=YSUM/ (N-20.0) 
XVAR=X2SUM/ (N-1) 
XSD-SQRT(XVAR) 
XSKEW-(X3SUM/(N-1))/(XSD**3) 
XKURT-(X4SUM/(N-1))/(XSD**4) 
XTSTAT-XBAR*SQRT(N)/XSD 


WRITE(6,130)N1,XBAR, XSD, XVAR, XTSTAT , XSKEW , XKURT 
130 FORMAT(/1X,'RETURNS',8X,'MEAN',5X, ST.DEV.',8X, 'VAR. ' , 6X, 


1'T-STAT.',7X,'SKEW.', 7X, KURT. ' ,/1X,I6,F13.6,F11.4,F13.6,3F12.2) 


GMRET-100.0*(EXP(252.0*XBAR)-1.0) 


WRITE(6,140)GMRET 


140 FORMAT(/1X,'AVERAGE X IS ABOUT EQUIVALENT TO A',F6.2, 


11% COMPOUND ANNUAL RETURN') 


АМКЕТ-100.0%(ЕХР(252.0%ХВАК%126.0%ХУАК)-1.0) 


WRITE(6,150) AMRET 


150 FORMAT(/1X,'XBAR AND XVAR SUGGEST A',F6.2, 


11% AVERAGE ANNUAL RETURN' } 


с SUMMARISE OUTLIERS (SECTION 2.7) 


DO 170 I-1,K 
DO 160 J-FIRSTX(I),LAST(I) 
XJ2Z2(J,I)-Z(J-1,I) 
STDX-(XJ-XBAR) /XSD 
IF(ABS(STDX).LT.2.0)GOTO 160 
ASTDX-ABS(STDX) 
ISTDX-ASTDX 
IF(ISTDX.GT.20) ISTDX=20 
K2=2 
IF(STDX.LT.0.0)K2=1 
DO 155 Ki=2,ISTDX 
OUTS(K1,K2)-OUTS(K1,K2)41 
155 OUTS(Ki,3)=OUTS(K1,3)+1 
160 CONTINUE 
170 CONTINUE 
WRITE(6,180) 


180 FORMAT(//1X,'OUTLIERS EXCEEDING K ST. DEVS.',/2X,'K',4X, 
1'-VE',4X,'*«VE', 4X, 'TOTAL' ,/1X) 


DO 190 I-2,20 
IF(OUTS(I,3).EQ.0)GOTO 190 


WRITE(6,185)I,(OUTS(I,J),J7721,3) 


185 FORMAT(1X,I2,3I7) 
190 CONTINUE 


(SECTIONS 2.9 & 2.10) 


оо оо 


DO 220 I=1,K 


CALCULATE AUTOCORRELATIONS OF X, АВ5(Х) & X*X 
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оооо 


JSTART=FIRSTX(I) 
JEND-LAST(I) 
DO 205 J=JSTART, JEND 
205 X(J)=Z(J,1)-Z(J-1,1)-XBAR 
CALL ACF(X,XSIZE,VX,CX,RX,CSIZE,NRX, JSTART, JEND) 
DO 210 J=JSTART, JEND 
210 X(J)=ABS(X(J)+XBAR)-ABSXBAR 
CALL ACF(X,XSIZE,VABS,CABS,RABS,CSIZE,NRX,JSTART, JEND) 
DO 215 J=JSTART, JEND 
215 X(J)=((X(J)+ABSXBAR) **2) -XXBAR 
CALL ACF(X,XSIZE,VSQD,CSQD,RSQD,CSIZE,NRX, JSTART, JEND) 
220 CONTINUE 
SE-1.0/SQRT(N) 
CIL--1.96*SE 
CIU=1.96*SE 
WRITE(6,225)N1 
IF(K.GT.1)WRITE(6,226)K 
225 FORMAT(//1X,'AUTOCORRELATION COEFFICIENTS CALCULATED FROM', 
1I6,' OBSERVATIONS') 
226 FORMAT(1X,'AND',I3,' FUTURES') 
WRITE(6,227) 
227 FORMAT(/1X,'LAG',6X,'RETURNS',7X,'ABS. RETURNS',5X, 
1'SQD. RETURNS') 
DO 240 I=1,NRX 
WRITE(6,230)I,RX(I),RABS(I),RSQD(I) 
230 FORMAT(1X,I3,F13.4,F16.4,F17.4) 
240 CONTINUE 
WRITE(6,245)SE,CIL,CIU 
245 FORMAT(/1X,'STANDARD ERROR',F8.4,' 95% CONFIDENCE LIMIT', 
1F8.4,' TO',F8.4,' FOR STRICT W.N.') 


ESTIMATE PARAMETERS FOR THE STANDARD DEVIATION PROCESS 
(SECTION 3.8) AND THE SPECIAL CASE LOGNORMAL AR(1) (SECTION 3.9) 


ESTMUV-MBAR/DELTA 

ESTSIGV-SQRT(SBAR-ESTMUV**2) 

ALPHAH=ALOG ( MBAR *MBAR/ (DELTA*DELTA*SQRT(SBAR) )) 

BETAH=SQRT { ALOG( DELTA* DELTA*SBAR/ (MBAR*MBAR) ) ) 
VMEDIAN-EXP(ALPHAH) 

VCIL-EXP(ALPHAH-1.96*BETAH) 

VCIU=EXP (ALPHAH+1.96*BETAH) 
WRITE(6,250)DELTA,VMEDIAN,ESTMUV,ESTSIGV,VCIL,VCIU,ALPHAH, 


1BETAH 

250 FORMAT(///1X,'PARAMETER ESTIMATES FOR THE PROCESS V OF CONDITIONAL 
1 STANDARD DEVIATIONS, USING DELTA =',F7.3,//1X,'FOR V, MEDIAN =', 
2F8.4,' MEAN -',F8.4,' ST. DEV. =',F8.4,' 95% RANGE FROM', 
3F8.4,' TO',F8.4,/1X,'FOR LOG(V), MEAN (ALPHA) -',F7.3, 
4' ST. DEV. (BETA) =!,F7.3) 


DO 260 I=1,K 
JSTART-FIRSTX(I) 
JEND-LAST(I) 
DO 255 J=JSTART, JEND 
255 X(J)-ABS(Z(J,I)-Z(J-1,1I) -XBAR) -MBAR 
CALL ACF(X,XSIZE,VM,CM,RM,CSIZE,NRM, JSTART, JEND) 
260 CONTINUE 
DO 262 I-1,NRM 
262 RM(I)-RM(I)*N/(N-K*I) 
CALL FITV(RM,CSIZE,NRM,N,ESTB, PHIH,FMIN) 
BTEST-2.0-3.141592*ESTB 
IF(BTEST.LT.O.O)WRITE(6,264) 
264 FORMAT(/1X,'ESTIMATED B IS INFEASIBLE') 
IF(BTEST.LT.O.O)BETAMH-O.O 
IF(BTEST.GT.O.0) BETAMH-SQRT(ALOG(2.0*(1.0-ESTB) /BTEST)) 
CALL MAPARA(ESTB,PHIH,THETAH) 
WRITE(6,265)FMIN,ESTB,PHIH, THETAH, BETAMH 
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осоо 


265 


268 
270 


280 


285 
290 


295 


300 


305 


310 
320 


330 


340 


FORMAT(/1X,'MINIMUM VALUE OF F6 IS',F6.2,' GIVEN BY B -', 

1Е8.4,' AND PHI =',F7.3,/1X,'THESE IMPLY THETA FOR FORECASTING IS 
2',F7.3,' AND ANOTHER BETA ESTIMATE IS',F7.3,//1X, 
S'AUTOCORRELATIONS OF ADJUSTED ABSOLUTE RETURNS M',/1X,'LAG', 
4ATX,'ACTUAL',7X,'FITTED') 

DO 270 I-1,NRM 

FITTED-ESTB* (PHIH**I) 

WRITE(6,268)I,RM(I),FITTED 

FORMAT(1X,13,2F13.4) 

CONTINUE 


ESTIMATE VARIANCES OF AUTOCORRELATIONS (SECTIONS 5.7 & 5.8) 


DO 295 I-1,K 
JSTART=FIRSTY (I) 

JX=FIRSTX (I) 

JEND-LAST(I) 

VHAT-FIRSTV(I) 

DO 280 J-JSTART,JEND 

X(J)2Z2(J, I) -Z(2-1,I) 

¥(J)=(X(J)-XBAR) /VHAT 

VHAT-G1*VHAT-«G*ABS(X(J) -XBAR) 

IF(I.GT.1)GOTO 290 

JS1-JSTART-1 

DO 285 J-JX,JS1 

Х(2)-2(2,1)-2(2-1,1) 

CALL VARR(X,XBAR,XSIZE,B,BSIZE,DENX,NVARR,JX,JEND,SQUARES) 
CALL VARR(Y,YBAR,XSIZE,BSTAR,BSIZE,DENY,NVARR,JSTART,JEND,SQUARES) 
CONTINUE 

X1-N/(DENX*DENX) 

Y1-(N-20.0)/(DENY*DENY) 

DO 300 I-1,NVARR 

B(I)-B(I)*X1 

BSTAR(I)-BSTAR(I)*Y1 

WRITE(6,305)G 

FORMAT(//1X,'ESTIMATES OF THE VARIANCES OF THE AUTOCORRELATION COE 
1FFICIENTS (* SERIES LENGTH)',/4X,'X-DATA ARE RETURNS', 
2/4X,'Y-DATA ARE RETURNS RESCALED USING GAMMA -',F6.2, 
3//1X,'LAG',4X, 'X-DATA' , 4X, ' Y-DATA' , /1X) 

DO 320 I-1,NVARR 

WRITE(6,310)I,B(I),BSTAR(I) 

FORMAT (1X, I3,2F10.3) 

CONTINUE 


CALCULATE AUTOCORRELATIONS OF RESCALED RETURNS AND ASSORTED 
RANDOM WALK TEST STATISTICS (SECTIONS 6.4, 6.6, 6.7 & 6.9) 


DO 340 I=1,K 
JS-FIRSTX(I) 
JSTART-FIRSTY(I) 
JEND-LAST(I) 
VHAT-FIRSTV(I) 

DO 330 J=JS,JEND 
X(J)=2(J,1)-Z(J-1,1) 
IF(J.LT.JSTART)GOTO 330 


Y(J)2((X(J) -KBAR) /VHAT) -YBAR 
VHAT=G1* VHAT+G* ABS (Х(Ј) -ХВАК) 
CONTINUE 


CALL ACF(Y,XSIZE,VY,CY,RY,CSIZE,NRY,JSTART, JEND) 


CALL RUNSTEST(X,XSIZE,TYPE,RUNS,ER,SR,RUNT, JS, JEND, MINDIF) 
CONTINUE 


NY=N-20.0 

NY1=N1-20 
WRITE(6,225)NY1 
IF(K.GT.1)WRITE(6,226)K 
WRITE(6,350) 
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350 FORMAT(/1X,'LAG',6X,'FOR RESCALED RETURNS') 
DO 370 I-1,30 
WRITE(6,360)I,RY(I) 
360 FORMAT(1X,I3,F13.4) 
370 CONTINUE 
SNX-SQRT(N) 
SNY-SQRT (NY) 
CIUY-1.96/SNY 
NSIGRY=0 
QSUM=0.0 
R1TEST=RY (1) *SNY 
DO 380 I=1,50 
QSUM=QSUM+NY*RY(I) *RY(I) 
IF(I.EQ.10)Q10-QSUM 
IF(I.EQ.30)Q30-QSUM 
IF(I.EQ.50)Q50-QSUM 
IF(I.GT.30)GOTO 380 
TX=TX+(0.92**I) *RX(I) 
TY=TY+(0.92**I)*RY(I) 
IF((I.LE.28).AND.(ABS(RY(I)).GT.CIUY))NSIGRY-NSIGRY*1 
380 CONTINUE 
UX-0.4649*SNX* (TX-0.92*RX(1)) 
UY-0.4649*SNY*(TY-0.92*RY(1)) 
TX-0.4274*SNX* TX 
TY-0.4274*SNY*TY 
WRITE(6,420)TY,UY,RITEST,NSIGRY,N28,Q10,030,Q50 
420 FORMAT(///1X,'VALUES OF THE AUTOCORRELATION TEST STATISTICS', 


1//3X,'T =',F6.2,', U =',F6.2,/3X,'R(1) =',F6.2,' SE UNITS', 
2/3X,'NUMBER OF SIGNIFICANT R(TAU) AT THE 5% LEVEL',I4,' OUT ОҒ", 
314,/3X,'Q10 -',F7.2,', 030 -!,Е7.2,!, Q50 =',F7.2) 
WRITE(6,430) TX,UX 

430 FORMAT(/1X,'USING RETURNS WOULD GIVE : Т =',F6.2,' AND U =',F6.2) 


WRITE(6,440) ( TYPE( I), 121,3) , RUNS, ER, SR, RUNT 
440 FORMAT(//1X,'RUNS ANALYSIS',/1X,F7.1,' DOWNS',/1X,F7.1,' UPS', 

1/1X,F7.1,' NO CHANGES',//1X,F7.1,' RUNS OBSERVED',/1X,F7.1, 

2' RUNS EXPECTED',/1X,F7.1,' S.D.',//AX,'RUNS TEST STATISTIC =', 

3F6.2) 

NRSP1=NRSP+1 

CALL SPECTRAL(RY,CSIZE,SPEC,SPECSD,SPECD,SSIZE,NRSP1,NY,NSIGFJ) 

J=NRSP-1 

WRITE(6,460)J,NRSP 
460 FORMAT(///1X,'SPECTRAL DENSITY ESTIMATED FROM FOURIER TRANSFORM OF 

1',/1X,'THE FIRST',I4,' AUTOCORRELATIONS.  PARZEN WINDOW USED.', 

2/1X,'ESTIMATED AT FREQUENCIES PI/',I4,' APART.',//1X,'FREQ/PI', 

32X,'PERIOD',3X,'DENSITY' , 5X, 'SD' , 5X, 'STANDARDISED') 

DO 500 I-1,NRSP1,4 

11=1-1 

FR=FLOAT(I1) /FLOAT(NRSP ) 

IF(I.GT.1)GOTO 480 

WRITE(6,470)FR,SPEC(1),SPECSD(1), SPECD(1) 
470 FORMAT(1X,F6.3,6X,'INF',F9.3,2F10.3) 

GOTO 500 
480 PER=2.0/FR 

WRITE(6,490)FR,PER,SPEC(I),SPECSD(I),SPECD(I) 
490 FORMAT(1X,F6.3,F9.2,F9.3,2F10.3) 


500 CONTINUE 
IW=1+(4*NRSP)/10 
J1=1+NRSP /4 
WRITE(6,520)SPECD(1),SPECD(IW),NSIGFJ,J1 

520 FORMAT(//1X,'SPECTRAL TEST STATISTICS',/3X,'ZERO FREQ.',F8.3, 
1/3X,'WEEK CYCLE',F8.3,/3X,'NUMBER OF SIGNIFICANT F(J) AT THE 5% LE 
2VEL',I4,' OUT OF',14) 


с ESTIMATE TREND MODEL PARAMETERS (SECTION 7.3) 


MMAX-40 
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ї1=1 

DO 610 I-I1,NRFIT 

RX(I)2N*RX(I)/(N-K*I) 
610 RY(I)sNY*RY(I)/(NY-K*I) 

CALL FIT(RX,CSIZE,I1,NRFIT,AEST1,SMIN1,ASIZE,MMAX,N,MHATX, AHATX) 

CALL FIT(RY,CSIZE,I1,NRFIT,AEST2,SMIN2,ASIZE,MMAX,NY,MHATY , AHATY) 

WRITE(6,620)11,NRFIT 
620 FORMAT(///1X,'TREND PARAMETERS ESTIMATED FROM AUTOCORRELATIONS', 

113,' TO',I3,/AX,'FIRSTLY FROM RETURNS, SECONDLY FROM RESCALED КЕТО 

2RNS',//4X,'M',5X,'P',10X,'A1',9X,'S1',10X,'A2',9X,'S2') 

DO 640 I=1,MMAX 

P=1.0-1.0/FLOAT(I) 

WRITE(6,630)I,P,AEST1(I), SMIN1(I),AEST2(I) , SMIN2(I) 
630 FORMAT(1X,I4,F9.4,2(F11.4,F11.2)) 
640 CONTINUE 

PHATX=1.0-1.0/FLOAT (MHATX) 

CALL MAPARA( AHATX, PHATX, QHATX) 

PHATY-1.0-1.0/FLOAT(MHATY) 

CALL MAPARA(AHATY,PHATY,QHATY) 

WRITE(6,650)MHATX,AHATX,PHATX,QHATX,MHATY , AHATY , PHATY , QHATY 


650 FORMAT(/1X,'MINIMISING F1 GIVES : M =',I3,', A =',F7.4, 
1', P -!,Е7.4,! AND Q =',F7.4,/1X,'MINIMISING F2 GIVES : M -', 
213,', A =',F7.4,', P =',F7.4,' AND Q =',ЕТ.4) 
STOP 
END 


SUBROUTINE ACF(X,N,V,C,R,M,NR,J1,J2) 
REAL X(N),C(M),R(M) 
DO 100 J=J1,J32 
100 V=V+X(J)*X(J) 
DO 400 I-1,NR 


JEND-J2-I 
IF(JEND.LT.J1)GOTO 300 
K=J1+I-1 

DO 200 J=J1,JEND 

K=K+1 


C(I)=C(I)+X(J)*X(K) 
200 CONTINUE 
300 R(I)=C(I)/V 
400 CONTINUE 

RETURN 

END 


SUBROUTINE FIT(R,NR,I1,I2,A,S,NS,MMAX,N,MHAT,AHAT) 
REAL N,R(NR),A(NS),S(NS) 
A(1)=0.0 
$(1)=0.0 
DO 100 I-I1,I2 
100 S(1)2S(1)*N*R(I)*R(I) 
SMIN-S(1) 
MHAT-1 
DO 300 I-2,MMAX 
P=1.0-1.0/FLOAT(I) 
A(I)=0.0 
D=0.0 
DO 200 J-I1,I2 
A(I)-A(I)*R(J)*(P**J) 
200 D-D4P**(J4J) 
A(I)SA(I)/D 
IF(A(I).LT.O.O)A(I)-0.0 
S(I)-S(1)-N*A(I)*A(I)*D 
IF(S(I).GT.SMIN)GOTO 300 
IF(A(I).LT.0.000001)GOTO 300 
SMIN-S(I) 
AHAT-A(I) 
MHAT-I 
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300 CONTINUE 
RETURN 
END 


SUBROUTINE FITV(R,NR,J1,N,BHAT,PHIHAT,FMIN) 
REAL N,R(NR),PHI(226) 
DO 10 I=1,51 

10 PHI(I)=0.01*FLOAT(I-1) 

DO 20 I-52,131 

20 PHI(I)=0.5+0.005*FLOAT(I-51) 
DO 30 I-132,226 

30 PHI(I)=0.9+0.001*FLOAT(I-131) 
FMIN-100.0*N 
BESTB-0.0 
DO 200 I-1,226 
IF(I.EQ.1)GOTO 120 
S-0.0 
SP=0.0 
DO 100 J=1,J1 
P=PHI(I)**J 
S=S+R(J)*P 

100 SP=SP+P*P 
BESTB=S/SP 

120 F=0.0 
DO 150 2-1,71 

150 F=F+N*((R(J)-BESTB*(PHI(I)**J))**2) 
IF(F.GT.FMIN)GOTO 200 
BHAT=BESTB 
FMIN=F 
PHIHAT=PHI(I) 

200 CONTINUE 
RETURN 
END 


SUBROUTINE MAPARA(A,P,Q) 
Bz(1.0*P*P*(1.0-2.0*A) )/(P*(1.0-A)) 
Q=0.5* (B-SQRT(B*B-4.0)) 

RETURN 

END 


SUBROUTINE RUNSTEST(X,N,S,RUNS, ER, SIGR, K, N1, N2, E2) 
REAL К,5(3),Т(3),Х(М) 
Е1=-Е2 
RUNS=RUNS+1.0 
DO 200 I-N1,N2 
J23 
IF(X(I).LT.E1)J-1 
IF(X(I).GT.E2)J22 
5(7)-5(7)%1.0 
IF(I.EQ.N1)GOTO 100 
IF(J.NE.J1)RUNS=RUNS+1.O 

100 21-2 

200 CONTINUE 
DO 400 I=1,3 
Т(І)-0.0 
DO 300 2-1,3 

300 Т(І)-Т(1)%5(7)%“І 

400 CONTINUE 
ER=T(1)+1.0-T(2)/T(1) 
A=T(2)*(T(2)+(T(1)*(T(1)+1.0)))-2.0*T(1)*T(3)-(T(1) **3) 
SIGR=SQRT(A/((T(1)**3)-T(1))) 
K=(RUNS-ER) /SIGR 
RETURN 
END 
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100 


200 


400 


100 


200 
300 


SUBROUTINE SPECTRAL(R,NR,SPEC,SPECSD,SPECD,MS,M1,N,NSIG) 
REAL N,R(NR),SPEC(MS), ЅРЕСЅр (М5) , SPECD(MS) , LAMBDA( 100) 
M-M1-1 

MM1-M-1 

FM-FLOAT(M) 

DO 100 Із1,М 

A-FLOAT(I)/FM 
LAMBDA(I)721.0-6.0*A*A*6.0*A*A*A 
IF(A.GT.0.5)LAMBDA(I)-2.0*((1.0-A)**3) 
CONTINUE 

NSIG=0 

PI=3.141592 

TWOPI=PI+PI 

DO 400 J-1,M1 

OMEGA-PI*FLOAT(J-1)/FM 

OMK=0.0 

SPEC(J)=1.0 

SPECSD(J)=0.0 

DO 200 K=1,MM1 

OMK-OMK«OMEGA 
IF(OMK.GT.PI)OMK-OMK-TWOPI 
CL-2COS(OMK)*LAMBDA(K) 

SPEC (J)=SPEC(J)+2.0*CL*R(K) 
SPECSD(J}=SPECSD(J)+4.0*CL*CL 
CONTINUE 

SPECSD(J)=SQRT(SPECSD(J)/N) 
SPECD(J)=(SPEC(J)-1.0)/SPECSD(J) 
J1=J-1 

IF(J1.GT.(4*(J1/4)))GOTO 400 
IF(ABS(SPECD(J)).GT.1.96)NSIG=NSIG+1 
CONTINUE 

RETURN 

END 


SUBROUTINE VARR(X,XBAR,N,VR,M,D,NV,J1,J2,A) 
REAL X(N),VR(M),A(N) 
DO 100 J-J1,J2 
A(J)-(X(J) -KBAR)**2 
D=D+A(J) 

DO 300 I=1,NV 
JEND-J2-I 
IF(JEND.LT.J1)GOTO 300 
K=J1+I-1 

DO 200 J=J1,JEND 

K=K+1 
VR(I)*VR(I)*A(J)*A(K) 
CONTINUE 

CONTINUE 

RETURN 

END 
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